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INTRODUCTION 

The  broad  purpose  of  this  work  is  to  understand  the  genetic  basis  of  fracture  risk.  In 
order  to  accomplish  this  goal,  we  are  working  to  detect  and  identify  genes  whose  segregation 
alters  bone  biomechanical  performance  in  young  adult  mice.  Two  crosses  are  being  analyzed; 
the  first  is  an  intercross  of  recombinant  congenic  strains  HcB/13  X  HcB/14.  These  strains  each 
carry  1/8  of  their  genomes  from  C57BL/10ScSnA  and  the  remainder  from  C3H/DiSnA.  HcB/13 
and  HcB/14  are  highly  divergent  in  their  bone  properties,  but  only  — 1/4  of  the  genome  segregates 
in  this  cross,  allowing  analysis  of  epistatic  interactions  in  a  moderately  sized  cross.  The  second 
cross  is  between  outbred  B6C3/Fc-a/a-Cola2°'m/~  heterozygotes  and  B6C3/Fe-a/o  FI  animals. 
The  B6C3  background  is  closely  related  to  the  HcB/Dem  system.  In  both  cases,  bones  from  4 
month  animals  will  be  phenotyped  by  3-point  bend  testing,  radiographic  analysis,  ash 
percentage,  Fourier-transformed  infrared  spectroscopy,  and  histomorphometry.  Histograms  of 
the  trait  values  will  be  used  to  select  animals  for  use  in  linkage  mapping,  which  will  then  be 
carried  out  using  the  QTL  Cartographer  software  suite.  Linkage  mapping  of  distributed 
microsatellite  markers  will  be  supplemented  by  markers  isolated  through  representational 
difference  analysis  of  phenotypically  extreme  animals  from  each  cross.  QTLs  identified  in  both 
crosses  will  be  isolated  through  the  generation  of  congenic  lines  by  marker-assisted  selection. 
The  approach  taken  in  this  work  expands  on  earlier  investigations  of  bone  genetics  in  that 
biomechanical  performance  is  a  primary  endpoint  and  that  pleiotropy  testing  of  biomechanical 
outcomes  and  sub-phenotypes  relates  QTLs  to  specific  components  of  bone  strength. 
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BODY 

Statement  of  Work  Item  1:  Intercross  between  HcB/13  and  HcB/14 

Progress  on  this  item  was  delayed  approximately  6  months  due  to  issues  arising  from  my 
relocation  to  the  University  of  Wisconsin.  Some,  but  not  all  of  the  delay  has  been  made  up  since 
that  time. 

We  have  generated  and  phenotyped  the  13  X  14  and  14  X  13  FI  populations. 
Approximately  360  animals  were  generated  in  the  cross  and  sacrificed  at  17  +  1  weeks. 
Phenotyping  has  included  each  of  the  following: 

1 .  body  mass  at  sacrifice 

2.  ash  percentage  of  femora,  humeri,  and  radii  from  each  of  the  animals  (1  side  only) 

3.  femoral,  humeral,  and  radial  volumes  by  Archimedes’  principle  from  each  of  the  animals  (1 
side  only) 

4.  volumetric  BMDs  of  the  femora,  humeri,  and  radii  using  data  from  items  2  and  3  above  (1 
side  only) 

5.  areal  BMDs  of  the  femora,  humeri,  and  radii  using  the  PIXIMus  DXA  system  (GE  Lunar) 
(both  sides) 

6.  micro-CT  scans  of  femora,  humeri,  and  radii  using  the  micro-CAT  (ImTek,  Knoxville,  TN) 
(1  side  only) 

7.  3  point  bend  testing  of  femora,  humeri,  and  radii  (1  side  only). 

Analyses  of  these  data  is  in  progress.  This  aspect  of  the  work  is  proving  to  be  quite  time- 
consuming  due  to  the  sheer  quantity  of  information  accumulated,  and  unfortunately  is  still  not 
complete.  A  subset  of  the  phenotypic  data  have  been  analyzed  sufficiently  to  make  their 
presentation  informative.  These  are  briefly  summarized  below. 

Significant  sex,  cross  direction,  and  interaction  effects  on  body  mass  and  BMD  in  the  F2 
generation:  In  earlier  experiments,  we  noted  that  differences  in  biomechanical  performance 
between  HcB/13  and  HcB/14  can  be  accounted  for  largely  by  differences  in  bone  dimensions  and 
is  accompanied  by  similarly  large  differences  of  body  mass.  F2  animals  sacrificed  at  17+  I 
weeks  have  been  weighed  and  had  long  bones  examined  ex  vivo  by  DXA.  These  data  are  briefly 
summarized  below  in  table  1 . 
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Table  1.  Body  Masses  and  BMPs  of  HcB/13  X  HcB/14  F2  Animals 


13  X  14  Females 
N  =  96 

13  X  14 
Males 

N  =  102 

14  X  13  Females 
N  =  99 

14X13 

Males 

N  =  102 

Body  Mass  (g) 

21.4  +  2.7 

22.9  ±2.6 

20.0  +  2.8 

22.7  ±2.4 

Femoral  BMD 
(g/cm2) 

0.0642  ±0.0038 

0.0611  ±0.0037 

0.0616  ±0.0049 

0.0601  ±0.0033 

Humeral  BMD 
(g/cm2) 

0.0486  ±  0.0030 

0.0473  ±  0.0027 

0.0464  ±  0.0037 

0.0465  ±  0.0033 

Radial  BMD 
(g/cm2) 

0.0245  ±0.0014 

0.0246  ±0.0014 

0.0242  ±0.001 8 

0.0246  ±0.0015 

Values  in  cells  are  means  ±  standard  deviation.  BMDs  are  averages  of  right  and  left  bones  at 
each  site. 


The  body  masses  and  BMDs  of  the  F2  are  normally  distributed  within  arm  and  sex 
grouping  by  the  Kolmogorov-Smimov  test.  Comparison  among  the  4  groups  of  F2  by  2  way 
ANOVA  reveals  that  there  are  significant  differences  between  males  and  females  (P  <  0.001), 
between  13  X  14  and  14  X  13  arms  of  the  cross  (P  =  0.004),  and  a  significant  interaction 
between  these  2  factors  (P  =  0.028).  Femoral  BMD  shows  significant  effects  of  cross  arm  (P  =4 
x  10'5)  and  sex  (P  =  1  O'5),  but  no  interaction  between  these  factors.  At  the  humerus,  there  is  a 
significant  cross  arm  effect  (P  =  4  x  10'5),  but  no  sex  or  interaction  effect.  No  strain  or  sex 
effects  are  noted  at  the  radius.  Taken  together,  the  data  indicate  that  areal  BMD  is  greater  among 
13  X  14  F2  mice  than  14  X  13  F2  mice.  These  observations  illustrate  that  performing  a 
reciprocal  cross  not  only  provides  additional  linkage  mapping  information,  but  also  can  reveal 
additional  biology. 

Intersite  correlation  and  left-right  disparities  of  DXA  measurements:  These  observations  are 
summarized  in  2  appended  manuscripts.  The  first  manuscript,  now  in  press  at  the  Journal  of 
Clinical  Densitometry,  describes  performance  characteristics  of  ex  vivo  densitometry.  The 
second  manuscript  has  been  submitted  to  Calcified  Tissue  International  and  is  now  undergoing 
review. 
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Multiplexed  Micro-CT  scanning  of  long  bones:  In  order  to  incorporate  micro-CT  measurements 
of  bone  dimensions  into  biomechanical  testing  on  the  scale  of  a  several  hundred  animal 
intercross,  it  is  necessary  to  devise  a  scanning  protocol  that  allows  many  specimens  to  be 
scanned  simultaneously.  We  have  accomplished  this  by  positioning  bones  inside  the 
corrugations  of  corrugated  cardboard,  with  empty  spaces  in  the  lattice  left  to  allow  indexing  of 
the  positions  of  the  individual  bones  and  so  identify  each  specimen  unequivocally.  This  allows 
us  to  scan  -100  bones  at  once.  Figure  1  below  is  a  scout  view  of  such  an  array  and  figure  2  is 
an  arbitrary  cross  section. 


Figure  1 .  Scout  View  of  Micro-CT 


Row  3 


Row  2 


Reconstruction  of  the  correct  cross-sections  for  calculation  of  cross-sectional  area  (CSA)  and 
cross  sectional  moment  of  inertia  (/)  still  requires  additional  analysis  once  the  micro-CT  images 
have  been  acquired.  Slice  number  corresponding  to  the  point  of  force  application  must  be 
approximated  and  the  neutral  axis  assigned.  As  is  apparent  from  the  single  slice  illustrated  in 
figure  2,  the  mid-diaphyses  of  the  individual  specimens  are  not  aligned,  so  slice  selection  must 
be  done  for  each  bone,  as  illustrated  in  figure  3. 
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Figure  2.  Arbitrary  Micro-CT  Cross-Section,  Low  Resolution  Image 
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Femora, humeri,  and  radii  are  included  in  this  cross-section. 

Figure  3.  Determination  of  centroid,  and  I  in  an  Individual  Femoral  Diaphysis 
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Analysis  of  the  biomechanical  testing  data  to  determine  material  properties  depends  on 
images  such  as  that  at  the  right  of  figure  3.  For  each  bone  tested,  a  similar  reconstruction  is 
undertaken.  Our  image  analysis  software  allows  for  automated  threshold  assignment  of  each 
pixel  as  bone  or  non-bone  and  determination  of  the  centroid.  We  must  assign  the  neutral  axis 
manually.  The  software  then  calculates  1=  £(pixel  area)  X  (distance  of  pixel  from  neutral  axis)2 
over  all  bone  pixels.  This  represents  a  major  improvement  over  the  calculation  of  I  using  the 
elliptical  approximation  in  our  earlier  work. 

The  manually  performed  portions  of  the  image  analysis,  while  not  inherently  difficult,  are 
tedious  and  time  consuming.  We  have  not  completed  enough  of  this  work  to  be  able  to  include 
meaningful  summary  data  regarding  bone  dimensions.  Similarly,  we  analyze  biomechanical  data 
in  tandem  with  the  image  analysis,  so  these  data  are  also  largely  unanalyzed. 

Biomechanical  testing  of  long  bones:  Table  2  summarizes  whole  bone  properties  of  femora, 
including  heritabilities  of  the  whole  bone  mechanical  properties.  These  data  are  notable  in 
confirming  the  pilot  data  regarding  differences  in  failure  load  between  HcB/13  and  HcB/14  and 
in  showing  that  the  mechanical  properties  have  a  substantial  genetic  component.  Also,  as  for 
body  mass,  the  2  arms  of  the  cross  differ  biomechanically.  Findings  are  similar,  but  less  robust, 
at  the  humerus  and  radius  (not  shown). 


Table  2.  Femoral  Whole  Bone  Biomechanical  Performance 


Bl 

El 

HcB/13 

MBIEI 

HcB/14 

aygifei 

TO - fiff— 

FI  all 

IfWlikl 

BUmm 

FI  males 

■IiltM 

mmsssm 

FI  females 

iTCTKil 

6.53  ±1.35 

13  X  14  FI 

ibxmii 

14  X  13  FI 

— Mwa 

In»£ai:l 

F2  all 

0.50 

0.60 

■EEEHEIHI 

0.44 

F2  males 

0.52 

0.71 

6.31  ±1.61 

0.75 

F2  females 

■HMMH 

0.44 

IMPtikl 

0.49 

0.20 

13  X  14  F2 

■EtfriajM 

0.45 

0.80 

0.12 

14  X  13  F2 

0.64 

0.48 

wmmmm 

0.57 

Data  are  mean  ±  sd. 

Heritability  calculated  as  [variance(F2)-variance  (Fl)]/variance(F2) 

A  striking  feature  of  the  data  is  that  in  the  intercross  females  exhibit  superior 
biomechanical  performance  to  males.  This  appears  to  be  due  to  an  allele  or  combination  present 
in  HcB/14,  as  the  same  pattern  was  observed  in  the  HcB/14  parental  mice. 

Determination  of  tissue-level  mechanical  properties  is  still  in  progress,  as  this  requires 
prior  completion  of  the  micro-CT  determination  of  /for  each  bone.  As  noted  above,  this  task  has 
taken  far  longer  than  anticipated. 

Linkage  analysis  of  the  data  has  not  yet  been  performed  because  the  phenotypes  are  still 
being  analyzed.  This  will  be  accomplished  following  calculation  of  the  tissue-level  mechanical 
properties. 
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Statement  of  Work  Item  2:  Genetic  Characterization  of  CollaT‘m  +  heterozygotes 

We  have  completed  the  initial  genetic  analysis  of  this  cross.  We  used  composite  interval 
mapping  in  QTL  cartographer  to  interpret  the  data  as  an  F3  intercross.  We  chose  to  interpret  the 
cross  as  an  F3  because  the  oim  breeding  scheme  results  in  the  inheritance  of  1  F2  chromosome 
and  1  chromosome  that  has  a  chromosome  that  has  been  derived  from  an  advanced  intercross  of 
at  least  F3,  providing  more  opportunities  for  crossing  over  than  would  exist  in  an  F2  intercross. 

Major  findings  from  the  initial  linkage  analysis  are  the  presence  of  statistically  significant 
coincident  linkage  peaks  for  failure  load  and  structural  striffness  on  chromosomes  4  and  6,  for 
cross-sectional  moment  of  inertia  on  chromosomes  1,  6,  and  16,  and  for  body  mass  on 
chromosome  1 .  Data  for  the  chromosome  4  QTL  are  illustrated  below  in  Figure  4.  Regression 
of  failure  load  on  genotype  shows  that  there  is  a  significant  regression  of  P  =  7.84  X  1  O'4,  with 
no  evidence  of  dominance,  with  C3  alleles  favoring  higher  failure  load,  accounting  for 
approximately  6.7%  of  the  observed  variance.  This  locus  coincides  with  that  found  by  Robling 
et  al.  mediating  mechanoresponsiveness  in  wild  type  mice  [1]. 


Figure  4.  Linkage  Map  and  Genotype-Phenotype  Regression  for  Chromosome  4 


A.  Multipoint  linkage  showing  failure  load  (red)  and  cross-sectional  moment  of  inertia  (black) 
as  a  function  of  map  position  on  mouse  chromosome  4. 
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Regression,  Conf.  &  Pred. 


B.  Regression  of  failure  load  on  genotype.  Genotypes  are  coded  as  C3/C3  =  0,  C3/B6  =  1,  and 
B6/B6  =  2.  The  regression  has  P  =  7.84  x  10~4  and  R2  =  0.067. 

In  addition  to  the  linkage  data  showing  a  distal  chromosome  4  modifier  of  Colla20'm,  we 
also  isolated  a  marker  in  this  chromosomal  region  by  RDA.  The  finding  of  a  locus  in  this  region 
by  multiple  groups  working  independently,  in  quite  different  experimental  systems  provides 
evidence  that  an  important  skeletal  gene  is  present  in  this  chromosomal  region.  However,  it  is 
important  to  note  that  it  is  not  possible  to  determine  whether  the  region  harbors  a  single  gene  or 
multiple  linked  genes  affecting  bone  strength. 

Our  chromosome  6  linkage  peak  overlaps  linkage  peaks  identified  by  Rosen  et  al.  for 
BMD  and  IGF-1  levels  [2,  3]  and  by  Klein  et  al.  for  femoral  cross-sectional  area  [4].  As  for 
chromosome  4,  the  fact  that  multiple  groups  of  investigators  have  found  coincident  QTLs 
provides  evidence  that  these  are  biologically  important  chromosome  regions. 

We  have  chosen  to  undertake  a  2nd  linkage  analysis  of  the  oim/+  population  using  the 
SOLAR  linkage  package.  We  have  done  this  because,  as  that  population  is  not  a  clean 
intercross.  In  order  to  better  account  for  the  outbreeding  of  the  B6C3  a/a  background,  we  are 
reanalyzing  the  cross  using  the  available  pedigree  information.  Because  of  my  inexperience 
using  the  SOLAR  package,  this  too  has  taken  longer  than  expected.  However,  we  expect  to 
submit  a  final  manuscript  on  this  work  sometime  this  summer.  We  will  forward  the  final, 
accepted  manuscript  to  you  when  available. 

Statement  of  Work  Item  3:  Initiation  of  Breeding  to  Generate  Congenic  Lines 
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This  aspect  has  not  yet  begun,  as  it  is  contingent  on  mapping  data  from  statement  of  work 

item  1. 
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KEY  RESEARCH  ACCOMPLISHMENTS 

•  Completion  of  the  HcB/13  X  HcB/14  reciprocal  intercross. 

•  Comprehensive  phenotyping  of  the  F2  generation 

•  Completion  of  genotyping  and  preliminary  analysis  of  the  Colla20,m/+  animals 

•  Finding  that  modifiers  of  Colla20'm  largely  overlap  bone  QTLs  found  in  wild-type  mice 

•  Observation  of  cross  direction  effects  on  body  mass  and  biomechanical  performance 

•  Observation  of  greater  BMD  and  BMC  in  left  bones  at  3  long  bone  sites 

•  Determination  of  the  performance  characteristics  of  ex  vivo  mouse  bone  DXA 

•  Observation  of  a  tooth  phenotype  in  Colla2°"n/om  and  Colla2°‘m/+  mice 

•  Development  of  capabilities  in  nanoindentation,  including  use  of  this  testing  method  in 
unembedded  cortical  bone 

•  Demonstration  that  Young’s  modulus  is  greater  in  1 1-week  old  Colla2°'m/oim  mice  than  either 
their  heterozyogous  or  wild  type  littermates. 

•  Demonstration  that  Young’s  modulus  of  cortical  bone  is  dependent  on  orientation 

•  Demonstration  that  Young’s  modulus  of  cortical  bone  is  greater  in  LRP5HBM  mice  than  in 
their  nontransgenic  littermates 
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REPORTABLE  OUTCOMES 
Manuscripts 

The  preliminary  data  included  in  the  original  application  for  this  award  are  reported  in  a  paper 
published  in  The  Journal  of  Bom  and  Mineral  Research  [5].  It  is  included  as  an  appendix. 

An  invited  perspective  [6]  was  published  in  The  Journal  of  Bone  and  Mineral  Research  and  is 
included  as  an  appendix. 

An  original  manuscript  describing  the  inhibitory  effects  of  sulfate  on  in  vitro  mineralization  was 
published  in  2001  [7]  and  is  included  as  an  appendix.  While  not  a  component  of  the  approved 
Statement  of  Work,  the  investigation  was  greatly  facilitated  by  this  award. 


The  manuscript  describing  the  use  of  GC-clamps  as  an  aid  to  the  design  of  denaturing  high 
performance  liquid  chromatography  originally  submitted  to  Clinical  Chemistry  was  published  in 
Clinical  Medicine  and  Research  [8].  The  original  reviewers  requested  significant  shortening  the 
report  to  approximately  14  of  its  present  length  and  limitation  to  a  single  figure.  The  co-authors 
and  I  therefore  decided  to  submit  it  elsewhere  because  we  felt  that  the  requested  revisions  would 
have  compromised  our  message.  This  paper  is  included  as  an  appendix.  While  not  a  component 
of  the  approved  Statement  of  Work,  this  work  nevertheless  could  not  have  been  accomplished 
without  this  award. 

An  original  manuscript  describing  Fourier  Transform  Infrared  spectroscopic  data  from  the  HcB/8 
and  HcB/23  recombinant  congenic  strains  was  published  in  Connective  Tissue  Research  in  2003 
[9].  While  study  of  this  pair  of  strains  was  not  included  in  the  approved  Statement  of  Work,  the 
large  differences  in  the  calculated  tissue  strength  of  bones  from  these  strains  prompted  our 
choice  of  these  strains  for  initial  investigation  of  collagen  cross-link  maturity  and  crystallinity  in 
these  strains.  Support  provided  by  this  award  was  invaluable  in  performing  these  studies.  This 
paper  is  included  as  an  appendix. 

An  original  manuscript  reporting  the  accuracy  and  precision  of  ex  vivo  DXA  scanning  of  mouse 
long  bones  was  published  in  the  Journal  of  Clinical  Densitometry  [10].  The  major  findings 
include  the  presence  of  a  position  artifact  in  our  instrument,  resulting  in  the  need  to  adjust  data 
according  to  specimens’  placement.  Performance  is  poorer  than  for  intact  mice.  As  expected, 
performance  varies  inversely  with  bone  size,  with  radii  too  small  for  valid  data  to  be  obtainable. 
There  is  no  advantage  to  scanning  bones  submerged  in  water  over  scanning  them  in  air.  The 
paper  is  included  as  an  appendix. 

An  invited  perspective  on  skeletal  genetics  was  published  in  IBMS  BoneKEy  [11]  and  is 
included  as  an  appendix. 

An  original  manuscript  reporting  BMD  correlations  between  DXA  measurements  of  ex  vivo 
femora,  humeri,  and  radii  was  published  in  Calcified  Tissue  International  [12].  The  main 
findings  are  that  mice  from  our  intercross  have  higher  BMDs  on  the  left  than  on  the  right  side  at 
both  the  femur  and  humerus.  This  paper  is  included  as  an  appendix. 
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An  original  manuscript  reporting  the  obstacles  to  application  of  the  International  Society  for 
Clinical  Densitometry’s  lumbar  spine  DXA  interpretation  guidelines  was  published  in  the 
Journal  of  Bone  and  Mineral  Research  [  1 3],  This  clinical  work  developed  in  part  because  of  our 
earlier  work  with  DXA  scanning  in  mice. 

An  original  manuscript  describing  the  dental  phenotype  of  homozygous  and  heterozygous  mice 
harboring  the  Colla2°m  mutation  is  in  press  at  Bone  [14].  The  main  findings  are  that  the 
mutation  causes  reduction  of  pulp  chamber  size,  reduction  of  the  number  and  organization  of 
dental  tubules,  and  reduction  of  dentin  mineralization.  Proofs  of  this  paper  are  included  as  an 
appendix. 

An  original  manuscript  describing  the  tissue-level  biomechanical  properties  of  cortical  bone  in 
B6C3  a/a  mice  harboring  various  Colla2  genotypes  is  in  the  final  stages  of  preparation.  The 
main  findings  include  demonstration  of  age  and  genotype-dependent  effects  on  Young’s 
modulus.  An  additional  study  demonstrated  anisotropic  behavior  in  spite  of  nanoindentation’s 
insensitivity  to  specimen  orientation.  Finally,  the  manuscript  reports  use  of  nanoindentation 
using  a  novel  technique  in  which  specimens  are  not  embedded,  allowing  compliance  analysis  to 
be  performed.  A  draft  is  included  as  an  appendix. 

Abstracts 

“Bone  Strength  Variability  in  Mice  Heterozygous  for  the  Cola2°mi/+  Mutation”  was  presented  as 
a  poster  at  the  American  Society  for  Bone  and  Mineral  Research  2000  annual  meeting. 

“Emergence  of  Fracture-Resistant  Cola2°'m/o'm  Mice”  was  presented  as  a  poster  at  the  American 
Society  for  Bone  and  Mineral  Research  2000  annual  meeting. 

Representational  difference  analysis  comparison  of  fully  outbred  Cola2°m/om  homozygotes  and 
partially  inbred  Cola2oun/oim  homozygotes  was  presented  as  a  poster  at  The  American  Society  for 
Bone  and  Mineral  Research  2001  annual  meeting. 

Geometry  as  a  Heritable  Determinant  of  Bone  Strength  was  presented  as  a  poster  at  The 
American  Society  for  Bone  and  Mineral  Research  2001  annual  meeting. 

“Phenotypic  Variability  of  OI  Due  to  the  Colla2°’m  Mutation”  was  presented  orally  at  the  2001 
Osteogenesis  Imperfecta  Workshop. 

Genotypic  and  Phenotypic  Characterization  of  Mouse  Bone  was  presented  orally  at  the  Care  and 
Characterization  of  Genetically  Engineered  Mice  and  Comparative  Pathology  in  Functional 
Genomics  Co-Conferences  sponsored  by  the  NIH-OLAW  and  the  C.L.  Davis,  DVM  Foundation. 

“Collagen  Cross-Link  Maturity  and  Crystallinity  Indices  Differ  Markedly  in  Recombinant 
Congenic  Mice  Having  Divergent  Calculated  Tissue  Strength”  was  presented  as  a  poster  at  The 
American  Bone  and  Mineral  Research  2002  annual  meeting. 
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“Modifiers  of  Biomechanical  Performance  in  Osteogenesis  Imperfecta  Mice:  Use  of  Genomic 
Representational  Difference  Analysis  to  Identify  Linked  RFLP  Markers”  was  presented  orally  at 
the  2002  Central  Society  for  Clinical  Research  meeting. 

“Dual  Energy  X  Ray  Absorptiometry  of  Ex  Vivo  Mouse  Long  Bones:  Site  and  Side 
Comparisons”  was  presented  as  a  poster  at  the  American  Bone  and  Mineral  Research  2003 
annual  meeting. 

“Supermice:  Mapping  Putative  Modifiers  of  Mechanical  Performance  In  Colla2om 

Homozygotes  and  Heterozygotes”  was  presented  orally  at  the  2003  Osteogenesis  Imperfecta 
Workshop. 

iiColla2°"n/o,m  and  Colla2oim/+  Mice  Express  Dentinogenesis  Imperfecta”  was  submitted  to  the  8th 
International  Conference  on  the  Chemistry  and  Biology  of  Mineralized  Tissues.  A  second 
version  of  this  abstract  was  submitted  to  the  American  Society  for  Bone  and  Mineral  Research 
for  presentation  at  the  2004  annual  meeting.  Dr.  Lopez  received  a  young  investigator  award  for 
this  work  from  ICCBMT. 

“Young’s  Modulus  of  C57BL/6J  Cortical  Bone  Is  Less  Than  That  of  Outbred  Collal°'m 
Heterozygotes  or  Their  Wild-Type  Littermates”  was  presented  at  the  2004  ASBMR  annual 
meeting. 

“Assessment  of  Bone  Quality  at  the  Sub-micron  Scale  in  Chain  Deficiency  Osteogenesis 
Imperfecta”  was  presented  as  a  poster  at  the  2005  ASBMR/NIAMS  Bone  Quality  Meeting.  Dr. 
Lopez  received  a  young  investigator  award  for  this  presentation. 

Patents  and  Licenses 
None 

Degrees  Granted 

Gloria  E.  Lopez  Franco  is  expected  to  receive  an  MS  in  Bacteriology  in  May,  2004.  Fler  thesis 
described  the  oral  flora  in  HcB/13  and  HcB/14  mice. 

Cell  Lines  and  other  Biological  Reagents 
None 

Informatics  Resources  and  Models 

We  have  developed  a  MS-Access/VBA  utility  to  facilitate  mouse  colony  management.  This 
system  automatically  assigns  unique  names  to  mice,  keeps  a  calendar  of  when  husbandry  tasks 
must  be  accomplished,  and  facilitates  organization  of  specimens  and  genotypes. 

Other  Funding 

A  new  application,  entitled  “Genetics  of  Bone  Tissue  Material  Properties  in  Mice”  was 
submitted  to  the  Department  of  Veterans’  Affairs  Merit  Review  program  in  December,  2001. 
The  application  was  approved  for  funding  and  the  project  will  run  from  10/1/02-9/30/06  and  will 
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allow  a  second  intercross,  between  HcB/8  and  HcB/23  to  be  undertaken.  The  application  was 
modeled  on  the  USAMRAA  project. 

Two  grant  applications  are  pending.  One,  based  on  the  Colla20,m  work  supported  in  this 
application,  proposes  an  intercross  of  C57BL/6  CollalMov',3/+  X  FVB/N  is  assigned  to  NIAMS. 

A  second,  proposing  construction  of  congenic  strains  harboring  QTLs  identified  in  this  work  is 
under  review  at  USAMRMC. 

Two  undergraduate  students  working  in  my  laboratory  have  received  competitive  awards  based 
in  part  on  their  work  with  me.  Tyriina  K.  O’Neil  received  a  Morris  Udall  Scholarship  for  the 
2004-2005  academic  year  and  again  for  the  2005-2006  academic  year.  Jonathan  L.  Vu  received 
a  University  of  Wisconsin  McNair  Scholarship  for  the  2005-2006  academic  year. 

Employment  and  other  Research  Opportunities 

Work  on  this  project  has  resulted  in  the  formation  of  an  interdisciplinary  bone  research  group  at 
the  University  of  Wisconsin.  The  group  includes  physicists,  engineers,  and  biologists  and  meets 
approximately  monthly.  In  addition  to  9  independent  investigators  based  in  Madison,  the  group 
also  includes  Dr.  N.P.  Camacho  of  the  Hospital  for  Special  Surgery,  New  York,  NY.  The 
group’s  goal  is  to  determine  the  role  that  spatial  organization  of  bone  tissue  plays  in 
biomechanical  performance.  Several  new  grant  proposals,  including  two  currently  being 
reviewed,  are  planned. 

We  recently  completed  experiments  performed  in  collaboration  with  Drs.  Mohammed  Akhter, 
Diane  Cullen,  and  Robert  Recker  at  Creighton  University,  investigating  tissue  level 
biomechanics  in  mice  harboring  an  LRP5HBM  transgene  and  their  nontransgenic  littermates.  We 
found  that  modulus  is  greater  in  the  transgenic  animals.  An  abstract  reporting  these  findings  has 
been  submitted  for  the  2005  ASBMR  meeting. 
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CONCLUSIONS 

Data  generated  in  this  project  to  date  suggest  that  all  elements  of  the  statement  of  work 
will  ultimately  be  realized.  The  data  acquisition  has  essentially  been  completed  for  statement  of 
work  items  1  and  2.  Statement  of  work  item  3  has  been  deferred  because  of  the  impossibility  of 
undertaking  it  without  fuller  analysis  of  the  item  1  data. 

The  HcB/13  X  HcB/14  intercross,  representing  item  1,  will  ultimately  provide  the  ability 
to  seek  epistatic  interactions  between  pairs  of  genes,  as  originally  planned.  The  heritability  and 
phenotypic  effects  observed  in  the  cross  correspond  quite  well  with  predictions  made  at  the 
outset.  An  important  pattern  of  findings  in  the  phenotypic  analysis  of  the  cross  is  that  the  13  X 
14  and  14  X  13  arms  of  the  cross  differ  from  each  other.  Differences  between  cross  arms  are 
apparent  for  multiple  traits,  including  body  mass  and  biomechanical  performance.  These 
observations  suggest  that  there  may  be  important  loci  mediating  these  traits  on  the  X 
chromosome.  In  addition,  this  aspect  of  the  work  led  us  to  consider  application  of  DXA 
methodology  to  mice  in  some  detail.  Lastly,  we  have  observed  disparities  between  left  and  right 
long  bones  of  individual  animals.  This  observation  may  represent  the  skeletal  manifestation  of 
behaviorally  documented  lateralization  in  the  mouse. 

Mapping  modifiers  of  Colla20,m  has  provided  independent  evidence  supporting  the 
presence  of  important  skeletal  genes  on  chromosomes  4  and  6,  where  others  had  found  QTLs  for 
BMD  and  femoral  cross-sectional  area  in  crosses  of  wild  type  mice.  It  remains  an  open  question 
whether  additional  modifiers,  with  significance  limited  to  the  disease  state,  also  exist.  Our 
investigation  of  this  murine  OI  model  has  also  led  to  2  significant  developments  outside  this 
project’s  plan  of  work.  First,  we  have  characterized  a  dental  phenotype  in  mice  harboring  the 
Colla2om  mutation.  Second,  our  work  has  led  to  the  formation  of  an  accomplished 
interdisciplinary  group  to  study  the  relationship  between  bone  tissue’s  structural  organization 
and  bone  strength.  A  subset  of  the  interdisciplinary  group’s  objectives  are  being  pursued  in  a 
VA  Merit  award  arising  from  work  supported  by  the  USAMRMC  award.  Third,  we  have 
developed  nanoindentation  testing  as  an  additional  capability  in  our  laboratory. 

Work  on  this  project  has  been  behind  schedule  for  2  reasons.  First,  some  time  was  lost  as 
a  consequence  of  my  move  from  the  Hospital  for  Special  Surgery  to  the  University  of  Wisconsin. 
Second,  reconstruction  and  analysis  of  micro-CT  data  has  proven  far  more  labor-intensive  and 
time  consuming  than  anticipated. 
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Abstract 

Differentiating  chick  limb-bud  mesenchymal  cell  micro-mass  cultures  routinely  mineralize  in  the  presence  of  10%  fetal  calf  •  ’ 
serum,  antibiotics,  4  mM  inorganic  phosphate  (or  2.5  mM  p-glycerophosphate),  0.3  mg/ml  glutamine  and  either  25  pg/ml 
vitamin  C  or  5-12  jrg/ml  vitamin  C-sulfate.  The  failure  of  these  cultures  to  produce  a  mineralized  matrix  (assessed  by 
electron  microscopy,  45Ca  uptake  and  Fourier  transform  infrared  microscopy)  led  to  the  evaluation  of  each  of  these  additives. 
We  report  here,  that  the  ‘stable’  vitamin  C-sulfate  (ascorbic  acid-2-sulfate)  causes  increased  sulfate  incorporation  into  the 
cartilage  matrix.  Furthermore,  the  release  of  sulfate  from  the  vitamin  C  derivative  appears  to  be  responsible  for  the  inhibition 
of  mineral  deposition,  as  demonstrated  in  cultures  with  equimolar  amounts  of  vitamin  C  and  sodium  sulfate.  ©  2001  Elsevier 
Science  B.V./Intemational  Society  of  Matrix  Biology.  All  rights  reserved. 

Keywords:  Ascorbic  acid;  Ascorbic-acid-2  sulfate;  Vitamin.  C;  Mineralization;  Cell  culture 


1.  Introduction 

Ascorbic  acid  (vitamin  C)  is  an  obligatory  require¬ 
ment  in  cultures  of  differentiated  osteoblasts  (Dean 
et  al.,  1994;  Franceschi  et  al.,  1995;  Siggelkow  et  al, 
1999;  Torii  et  al.,  1996;  Tullberg-Reinert  and  Jundt, 
1999)  and  chondrocytes  (Boskey  et  al.,  1991b;  Hall, 
1981;  Leboy  et  al.,  1989;  Shapiro  et  al.,  1991;  Sullivan 
et  al.,  1994;  Venezian  et  al.,  1998)  where  it  is  a 
co-factor  for  collagen  hydroxylases  (Tschank  et  al., 
1994),  an  activator  of  alkaline  phosphatase  gene  ex¬ 
pression  (Leboy  et  al.,  1989;  Venezian  et  al.,  1998) 
and  a  stimulator  of  chondrocyte  maturation  (Leboy  et 
al.,  1997).  Because  vitamin  C  is  rapidly  oxidized  to 
form  toxic  ascorbyl  radicals  (Makino  et  al,  1999)  it 
must  be  prepared  fresh  before  each  charge  of  cell 
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culture  medium.  Vitamin  C  persists  in  cultured  tis¬ 
sues  for  approximately  24  h,  although  it  is  detectable 
in  tissue  culture  medium  for  only  2  h  (Roach  et  al., 
1985).  Addition  of  fresh  vitamin  C  is  not" possible  for 
cell  culture  experiments  carried  out  in  bioreactors  on 
board  the  space  shuttle  or  in  biosatellites.  To  cir¬ 
cumvent  this  problem,  following  preliminary  evalua¬ 
tion,  vitamin  C-sulfate,  a  reportedly  stable  derivative 
of  vitamin  C  (Machlin  et  al.,  1976;  Tolbert  et  al., 
1975),  was  used  in  chondrocyte  cultures  flown  on  the 
several  NASA  shuttle  missions  (Doty  et  al.,  1999).. 
Recently,  to  save  technician  time,  we  replaced  the 
addition  of  fresh  vitamin  C  with  vitamin  C-sulfate  for 
ground-based  laboratory  experiments.  We  report  here 
that  vitamin  C-sulfate  inhibits  matrix  mineralization 
of  differentiating  mesenchymal  cell  micro-mass  cul¬ 
tures  supplemented  with  inorganic  phosphate  or  p- 
glycerophosphate,  without  having  effects  on  cell  vi¬ 
ability,  overall  matrix  synthesis,  or  alkaline .  phos¬ 
phatase  activity. 
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2.  Materials  and  methods 

2.1.  Cell  culture  methodology 

Chick  limb-bud  mesenchymal  cells  were  isolated 
from  stage  21-24  (Hamburger  and  Hamilton,  1951) 
fertilized  White  Leghorn  ■  eggs  (Truslow  Farms,  :  ■ 
Chestertown,  MD)  as  described  in  detail  ,  elsewhere 
(Boskey  et  al.,  1992b).  The  eggs  were  maintained  in  a 
humidified  incubator  at  37°C  for  4.5  days.  The  em¬ 
bryos  were  then  sterilely  withdrawn  from  the  eggs  and 
their  limb  buds  removed  into  0.9%  USP  grade  saline 
(Abbott  Laboratories,  N  Chicago,  IL).  Cells,  released 
from  the  limb  buds  by  digestion  with  5  ml  0.25  wt.% 
trypsin-0.53  mM  EDTA  (GIBCO,  Grand  Island,  NY), 
were  separated  from  debris  by  passage  through  two 
layers  of  20-pm  Nitex  membrane  (Tetko  Inc.,  Axdsley, 
NY).  Cells  were  counted  with  a  hemocytometer, 
checked  for  viability  by  trypan  blue  dye  exclusion  and 
pelleted  in  the  cold  at  2300  rev/min.  In  all  cases, 
viability  was  greater  than  or  equal  to  97%.  Cells  were 
resuspended  in  medium  containing  1.3  mM  Ca  and 
plated  using  the  micro-mass  technique  (Ahrens  et  al., 
1977),  at  a  density  of  0.75  million  cells  per  20  pi  drop 
in  35  X  10  mm  Falcon  dishes  and  allowed  to  attach 
for  2  h  in  a  humidified  atmosphere  of  5%  C02  at 
37°C.  After  2  h,  Dulbecco’s  modified  essential  medium 
(DMEM, -GIBCO  Formula  80-0303A,  Grand  Island, 
NY)  which  initially  contained  1  mM  inorganic  phos¬ 
phate  and  0.3  mM  calcium,  was  adjusted  to  have  final 
concentrations  of  1.3  mM  calcium  chloride,  1000  mg/1 
glucose  and  50  units/ml  penicillin  and  25  pg/ml 
streptomycin,  10%  fetal  calf  serum  (GIBCO,  Grand 
Island,  NY)  and  0.3  mg/ml  glutamine.  Mineralizing 
cultures  were  further  supplemented  with  3  mM  inor¬ 
ganic  phosphate  from  day  2  onward,  making  the  total 
inorganic  phosphate  content  4  mM.  In  preparation 
for  a  NASA  space  flight,  the  cultures  were  maintained 
in  a  CellMax  Quad  artificial  capillary  cell  culture 
system  (Cellco,  Inc.,  Germantown,  MD),  consisting  of 
a  chamber  to  contain  cells  flushed  with  medium  at 
controlled  temperatures  and  pressure.  Medium  con¬ 
taining  2.5  mM  (3-glycerophosphate  was  recycled 
through  the  chambers  at  a  rate  of  6  ml/min.  Half  of 
the  chambers  received  25  pg/ml  vitamin  C,  prepared 
fresh  with  each  medium  change,  with  the  medium 
bottles  changed  every.  2  days.  The  other  half  received 
5  pg/ml  vitamin  C-sulfate  prepared  once  and  recy¬ 
cled  for  the  length  of  the  experiment.  Based  on  the 
results  of  that  study  (Fig.  1),  vitamin  C-sulfate  was 
used  for  NASA  experiments.  For  the  purpose  of  the 
study  reported  here,  the  vitamin  C  supplements  (de¬ 
scribed  below)  were  added  with  every  medium  change 
from  day  2.  Non-mineralizing  cultures  received  no 
phosphate  supplements  but  did  receive  the  ascorbate 
supplements.  Medium  was  changed  every  48  h.  Cul- 


Fig.  1.  45Ca  uptake  in  differentiating  chick-limb  bud  mesenchymal 
cell  cultures,  maintained  in  Cell-Max  cartridges  for  15  days,  in  the 
presence  of  recirculating  medium  with  (open  squares)  25  p.g/ml 
(0.28  pM)  vitamin  C  (AA)  changed  every  2  days,  or  (closed  circles) 
5  |ig/ml  vitamin  C-sulfate  (AAS,  0.05  p,M).  The  dotted  line  is  fitted 
to  both  sets  of  data.  No  significance  differences  were  detected  at 
any  time  point. 

tures  were  maintained  for  21  days  with  day  0  referring 
to  the  day  of  plating. 

2.2.  Vitamin  C  additives 

Vitamin  C  [sodium  ascorbate  (Sigma  Chemicals,  St 
Louis,  MO)],  vitamin  C-sulfate  [sodium  ascorbate-2- 
sulfate  (Sigma  Chemicals,  St.  Louis,  MO)]  stored  at 
— 20°C  as  per  manufacturers  directions)  or  vitamin  C 
plus  an  equal  molar  concentration  of  sodium  sulfate 
(Fisher  Chemicals,  Springfield,  NJ)  were  added  to 
cultures  in  concentrations  of  25,  12.5  or  5  p-g/ml. 
Vitamin  C  was  also  added  at  a  concentration  of  50 
pg/ml.  To  determine  whether  the  Vitamin  C-sulfate 
degraded  ‘on  the  shelf,  in  some  experiments  the 
reagent  which  was  acquired  in  1995  (old)  and  was  still 
valid  according  to  the  manufacturer,  was  used,  while 
in  others  the  vitamin  C-sulfate  was  used  within  1 
week  of  receipt  from  the  manufacturer  and  prepared 
fresh  with  each  change  of  medium  (fresh).  Another 
batch  of  cultures  received  this  same  vitamin  C-sulfate 
prepared  at  the  start  of  the  experiment,  but  not 
remade  for  each  change  of  medium  (new).  In  another 
set  of  experiments,  the  ‘new’  reagent  was  heated  at 
60°C  for  4  h  to  accelerate  any  breakdown.  Cultures 
prepared  from  the  same  cell  preparation  but  main¬ 
tained  with  the  different  vitamin  C  sources  were 
collected  for  analyses  on  days  5,  12,  16,  19  and  21. 
Cell  viability  and  the  ability,  .to  form  chondrocyte 
nodules  was  determined  on  day  5,  based  on  propid- 
ium  iodide  and  alcian  blue  staining,  respectively.  - 

2.3.  Matrix  and  cell  parameters 

Cell  and  matrix  morphology  and  mineral  localiza¬ 
tion  were  analyzed  by  transmission  electron  micro- 


A.L.  Boskey  et  al  /Matrix Biology  20  (2001)  99-106 


101 


scopy  (TEM).  Samples  were  washed  first  with  0.05  M 
cacodylate  buffer  and  then  fixed  in  the  culture  dish 
for  12-18  h  at  4°C  in  EM  fixative  (0.5%  glutaralde- 
hyde,  2%  paraformaldehyde,  0.05  M  pH  7.2  cacody¬ 
late  buffer).  After  removal  of  the  fixative,  samples 
were  stored  at  4°C  in  0.05  M  cacodylate  buffer  con¬ 
taining  7%  sucrose.  Fixed  cultures  were  removed  from 
the  dish,  post-fixed  with  2%  aqueous  osmium,  dehy¬ 
drated  in  a  graded  series  of  alcohols  and  embedded  in 
Spiirr’s  resin.  Thin  sections  were  collected  on  water 
containing  bromthymol  blue  as  an  indicator  for  pH 
above  8.0. to  prevent  mineral  dissolution.  Sections 
stained  with  lead  citrate  and  alcoholic  uranyl  acetate 
were  examined  on  a  Philips  CM12  electron  micro¬ 
scope  and  representative  micrographs  photographed 
for  presentation. 

Matrix  properties  in  cultures  treated  with  different 
supplements  were  measured  on  day  16.  Specifically, 
total  proteoglycan  content  was  determined  using  a 


modification  of  the  dimethylene  blue  staining  method 
(Famdale  et  al.,  1982),  with  relative  values  reported 
as  absorbance  at  595  nm.  To  determine  the  extent  of 
proteoglycan  sulfation,  parallel  dishes  labeled  at  day 
15  with  35S04  (1  uCi/ml)  for  16  h  were  subjected  to’ 
the  dimethylene  blue  staining  procedure  and  the  radi- 
olabelled  dimethylene  blue-proteoglycan  complex  ex¬ 
tracted  into  4  M  guanidine  hydrochloride  and  an 
aliquot  of  the  extract  subjected  to  scintillation  count¬ 
ing.  Data  was  expressed  as  35S04/proteoglycan  absor¬ 
bance.  Collagen  hydroxyproline  was  measured  in  0.1- 
mg  aliquots  of  lyophilized  mineralizing  day  16  cul¬ 
tures.  Cultures  were  hydrolyzed  in  6  N  HC1  in  vacuo, 
after  flushing  with  nitrogen,  at  110°C  for  24  h.  An 
aliquot  of  each  hydrolysate  was  then  analyzed  for 
hydroxyproline  on  a  Varian  HPLC  9012/9050  con¬ 
figured  as  an  amino  acid  analyzer  (AA911  column, 
Interaction)  with  ninhydrin  color  development  at 
135°C  monitored  at  440  nm  (Uzawa  et  al.,  1998).  Data 


Fig.  2.  Morphological  differences  between  in  vitro  cartilage  growth  in  the  presence  of  25  pg/ml  vitamin  C  (Fig.  2a, c)  and  12.5  pg/ml  vitamin 
C-sulfate  (Fig.  2b,d).  Fig.  2a, b  are  control,  non-mineralizing  cultures  and  figures  c  and  d  are  mineralizing  cultures,  (a)  The  typical  appearance’ 
with  vitamin  C  shows  chondrocytes  with  normal  cytology  (i.e.  endoplasmic  reticulum  (ER),  golgi,  mitochondria,  etc.)  and  significant  space 
between  each  cell  which  is  filled  with  collagen  fibers  and  proteoglycans,  (b)  These  chondrocytes,  grown  in  the  presence  of  12.5  pg/ml  vitamin 
C-sulfate  show  membrane  bound  cytoplasmic  inclusions  (arrows)  scattered  among  the  normal  cytoplasmic  organelles.  In  addition,  the  cells  are 
cldsely  positioned  to  each  other  because  of  reduced  matrix  volume  compared  to  the  standard  vitamin  C  treated  cells  shown  in -Fig.  2a.  (c) 
Typical  day  21  mineralizing  culture  with  vitamin  C  The  mineral  has  completely  covered  the  underlying  collagen  matrix.  A  calcific  deposit  has 
formed  in  the  cartilage  matrix  adjacent  to  a  chondrocyte  with  normal  looking  cytoplasmic  content,  (d)  Mineralizing  culture  treated  with 
vitamin  C-sulfate.  This,  day  21,  culture  shows  a  small  amount  of  electron  dense  material  (arrows)  deposited  along  some  of  the  collagen  fibers. 
Well-formed  calcific  deposits  were  not  found.  At  this  magnification,  membrane  bound  cytoplasmic  inclusions  (I)  within  the  chondrocytes  are 
noted. 
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were  expressed  as  nmoles  hydroxyproline  per  mg  ' dry 

culture  weight.  . - . r^--- 

Alkaline  phosphatase  activity  (Sigma  Test  Kit  104- 
LL,  Sigma'Diagnostics,  St  Louis,  MO)  was  monitored 
on  day  12.  The  cultures  were  washed  twice  with  0.9% 
sodium  chloride  and  digested  at  room  temperature 
for  30  min  in  a  buffer  containing.  0.15  M  pH  9.0-Tris, 
0.1  mM  magnesium chloride,  0.1  mM  zinc  chloride- 
and  1%  Triton.  The  cultures  were  broken  apart  with' a 
cell  scraper  and’  the  incubation  repeated  and  the 
activity  of  the  total  extract  measured  using  p-nitro- 
phenyl-phosphate  as  the  substrate.  .  . . . 

2.4.  Mineral  analyses 


The  accumulation  of  mineral  in  the  cultures  was 
assayed,  as  detailed  elsewhere  (Boskey  et  al.,  1992b, 
1996,  1997b)  based  on  45Ca  uptake  (expressed  per 
total  culture).  In  brief,  45  Ca  uptake  was  determined 
for  each  micromass  culture  spot,  following  hydrolysis 
of  the  spot  in  2  N  HC1  (2  h,  60°C)  and  scintillation 
counting.  Uptake  was  expressed  as  percentage  of  min¬ 
eralizing  control  cultures  at  day  21.  The  presence  of 
mineral  was  also  validated  by  Fourier  transform  in¬ 
frared  microspectroscopy,  FT-IR  (Boskey  et  al., 
1992a).  FTIR  imaging  and  microscopy  studies  were 
done  on  fresh .  cultures,  placed  and  air  dryed,  on 
barium  fluoride  IR  windows.  Spectra  were  recorded  at 
20-|im  resolution  using  a  BioRad  Infrared  Spec¬ 
trometer  (Cambridge,  MA)  with  an  MCT  detector. 

2.5.  Statistical  evaluation 

Each  experiment  was  performed  at  least  in  tripli¬ 
cate,  with  three-four  dishes  of  each  condition  for 
each  time  point  for  each  individual  experiments.  In 
each  independent  experiment,  the  average  for  each 
condition  at  each  time  point  was  calculated  and  these 
average  values  used  to  calculate  means  for  all  experi¬ 
ments.  Significant  differences  between  experimental 
and  control  conditions  were  evaluated  based  on 
ANOVA  and  the  appropriate  statistical  test.  A  P<, 
0.05  was  taken  as  significant. 


3.  Results 

Fig.  1  presents  the  45  Ca  uptake  in  the"  Cell-Max’  ' 
cartridges,  comparing  a  typical  culture  with  5  (xg/ml 
vitamin  C-sulfate  (made  once)  and  25  p.g/mT vitamin 
C  (prepared  fresh  every  other  day).  As  can  be  seen, 
there  was  no  detectable  difference. 

Fig.  2  presents  typical  micrographs  which  indicate 
that  the  appearance  of  the  cellular  morphology  and 
mineral  deposition  in  cultures  treated  with  12.5  (Xg/ml 
vitamin  C-sulfate  differs  from  those  treated  with  25 


[ig/ml  vitamin  C.  Although  the  appearance  of  the 
matrix  seems  normal  in  both  non-mineralizing  (Fig. 
2a, b)  and  mineralizing  cultures  (Fig.  2c, d),  in  all  mi¬ 
crographs  examined,  there  was  less  matrix  between., 
chondrocytes  following  vitamin  C-sulfate  treatment 
(Fig.  2b, d)  compared  to  vitamin  C  alone  (Fig.  2a, c). 
Also,  many,  of  the  chondrocytes  showed  inclusion 
bodies  (Fig.  2d)  suggesting  that  cell  function  is  not. 
completely  normal  after  exposure  to  vitamin  C-sulfate.- . 
The  appearance  of  the  collagen  and  proteoglycans, 
however,  was  not  noticeably  different. 

The  various  vitamin  C  preparations  also  did  not 
have  a  significant  effect  on  nodule  size  (not  shown), 
or  on  total  proteoglycan  content  (Fig.  3  a),  however, 
35S04  incorporation  into  the  proteoglycans  present 
was  significantly  increased  in  all  cultures  treated  with 
vitamin  C-sulfate  (Fig.  3b).  Furthermore,  the  extent  of 
the  increase  was  greater  with  the  higher  doses  of 
vitamin  C-sulfate.  35S04  uptake  could  not  be  mea¬ 
sured  in  cultures  treated  with  vitamin  C  and  sodium 
sulfate,  because  of  the  relative  concentrations  of  cold 
and  radiolabelled  sulfate.  Alkaline  phosphatase  activ¬ 
ity  (Fig.  3c)  was  not  altered  by  the  various  vitamin  C 
treatments  at  day  12. 

The  hydroxyproline  content  of  the  matrix  of  the 
mineralizing  cultures  was  slightly  but  not  significantly 
reduced  in  cultures  treated  with  vitamin  C-sulfate  as 
contrasted  with  cultures  given  vitamiri  C.  The  three 
concentrations  of  vitamin  C  used  (12.5,  25  and  50 
(xg/ml)  yielded  the  same  hydroxyproline  content  and 
there  was  not  a  dose-dependent  effect  of  vitamin 
C-sulfate  (Fig.  4)  on  hydroxyproline  content.  The  min¬ 
eralizing  matrices  of  the  cultures  treated  with  vitamin 
C-sulfate  or  vitamin  C  and  sodium  sulfate  had  a  lower 
mass  at  day  16  (3.05  ±0.38  mg)  as  contrasted  to 
5.08  ±  1.1  mg  for  the  cultures  treated  with  ascorbate 
(P  <  0.003,  Welch  non-parametric  f-test). 

The  failure  to  mineralize  in  the  cultures  given  the 
sulfated  form  of  vitamin  C  or  vitamin  C  +  Na2S04 
was  evidenced  by  the  decreased  45  Ca  uptake  (Fig.  5) 
as  well  as  the  absence  of  mineral  in  FT-IR  spectra 
(not  shown).  The  data  in  Fig.  5  show  an  average  of  the 
data  for  each  of  the  concentrations  tested,  as  the 
uptake  in  cultures  with  5  or  12.5  |xg/ml  vitamin 
C-sulfate  did  not  differ  by  more  than  5%  and  the 
uptake  in  cultures  with  12.5, 25  and  50  |xg/ml  vitamin 
C  agreed  to  within  2%.  Cultures  given  ‘fresh’  vitamin 
C-sulfate,  from  a  new  bottle  and  prepared  with  each’ 
medium  change,  were  most  similar  to  control  cul¬ 
tures,  but  45Ca  uptake  was  highly  variable.  Cultures 
which  received  the  same  (‘new’)  vitamin-C  sulfate 
solution,  prepared  only  at  the  start  of  the  experiment, 
showed  a  decrease  in  calcium  accumulation.  Cultures 
given  the  old  vitamin  C-sulfate,  heated  vitamin  C- 
sulfate,  or  fresh  vitamin  C  plus  sodium  sulfate  showed 
no  mineral  accretion.  Furthermore,  45  Ca  uptake  in 


R-  the  non-mineralizing  vitamin  C  supplemented  cul- 
’  tures  -was  relatively  constant,  whereas;  in  the  vitamin 
.  C-sulfate  1  supplemented  •  cultures,  45Ca  uptake  in- 
~ '  creased  100%  within  10  days  and  the  rate  of  calcium 
■V-  ■  u^take: whs  3  X  that  hi_  the  vitamin  C  treated  cultures 
{•~  (data  not  shown).  I-  "!.’  .  .1..  . 


4.  Discussion  •  •  -  •• 

This  study  was  not  undertaken  to  investigate  the 
mechanism  "  of  action'  of  vitamin  C  as  compared  to 
vitamin' C-sulfate;  which  has  been  described  in  detail 
elsewhere  (Toibert  et  al.,  1975),  but  rather  to  de- 
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Fig.  4.  Effect  of  vitamin  C  additives  on  hydroxyproline  content  of  the  culture  matrix.  Hydroxyproline  cpntent  was  measured  ion  day 16  fa 
mineralizing  cultures  supplemented  with  5  or  12.5  pg/ml  vitamin  C-sulfate  (AAS)  from  an  old  bottle  (old),  vitamin  C-sulfate  prepared  wth 
each  medium  change  (fresh),  vitamin  C-sulfate  from  a  new  bottle,  but  not  prepared  fresh  for  each  medmm  change  (new),  vitamin  C-sulfate 
that  was  heated  to  promote  breakdown  (heated)  and  vitamin  C  with  equimolar  amounts  of  sodium  sulfate  (AA  +  S04).  yv  P 
vitamin  C  (AA)  was  used  in  concentrations  of  12.5,  25  and  50  pg/ml  and  results  are  shown  as  mean  for  all  concentrations  (n  =  9). 


termine  why  cultures  which  had  a  reproducible  pat¬ 
tern  of  mineral  deposition  for  many  years  (e.g.  Boskey 
et  aL,  1991b,  1992a,b,  1996, 1997b)  stopped  mineraliz¬ 
ing  when  for  the  purpose  of  efficiency,  our  laboratory 
switched  from  using  vitamin  C  to  vitamin  C-sulfate. 

Our  initial  studies  had  shown  vitamin  C-sulfate  at  ' 
concentrations  less  than  12.5  p-g/ml  were  not  toxic  in 
these  cultures  and  that  was  confirmed  here.  Vitamin 
C-sulfate,  which  was  not  prepared  fresh  with  each 


change  of  medium,  however,  or  was  not  newly  ac¬ 
quired  from  the  manufacturer,  inhibited  mineraliza¬ 
tion.  Since  there  is  no  advantage  to  using  the  sulfated 
form  if  it  has  to  be  prepared  with  each  medium 
change,  there  appears  to  be  no  rationale  for  the  use 
of  this  ‘stable’  form  in  this  type  of  culture.  A  vitamin 
C-2-phosphate  that  does  not  form  ascorbyl  radicals  is 
also  available  (Makino  et  al.,.  19,99)  and  this  stable 
vitamin  C  derivative  has  been  used  in  several  os- 


Fig.  5.  Effect  of  vitamin  C  supplements  on  mineral  accumulation  in  differentiating  mesenchymahceU 

prepared  vitamin  C  (AA)  in  concentrations  of  12.5,  25  and  50  p/ml,  or  5  or  12.5  p-g/ml  vitamin  C-sulfate  (AAS)  from  an  old  bottle  (old), 
52? c“  prepared  with  each  medium  change  (fresh),  vitamin  C-sulfate  from  a  new  bottle,  but  not  prepared  fresh  with  each  medium 
change  (new),  vitamin  C-sulfate  that  was  heated  to  promote  breakdown  (heated)  and  vitamin  C with  equimoto  Mounteo 
(AA+  S04).  Absolute  «Ca  uptake  has  been  normalized  to  uptake  at  21  days  in  mineralizing  cultures  (4P  +  AA),  values  are  memi  ±  SX)  for 
three-six  cultures  per  time  point.  *  Significantly  different  from  same  time  point  in  mineralizing  cultures  with  25  pg/mlAA  cultures,  P  <  0.05. 


A.L.  Boskey  et  aL  / Matrix  Biology  20  (2001)  99-106 


teoblast  (Gronthos  et  al.,  1994;  Hitomi  et  al.,  1992; 
Jaiswal  et  al.,  1997;  Tullberg-Reinert  and  Juhdt,  1999; 
Torii  et  al.,  1994)  and  other  connective  tissue  and 
muscle  (Fermor  et  al.,  1998;  Mitsumoto  et  al.,  1994) 
culture  systems.  Although  tljis  phosphoiylated-deriva- 
tive  might  have  been  preferable  in  the  mineralizing 
culture,  it  would  have  slightly  increased  local  phos¬ 
phate  concentrations  in  the  non-mineralizing  controls 
which  are  very  sensitive  to  changes  in  phosphate 
content  (Boskey  et  al.,  1992b)  and  thus  we  did  not 
investigate  its  use. 

There  are  several  possible  explanations  for  the 
observed  effects  of  vitamin  C-sulfate  on  development 
of  a  mineralized  matrix.  Vitamin  C  (ascorbic  acid)  is 
oxidized  generating  toxic  ascorbyl  radicals  and  perox¬ 
ides  (Wilgus  and  Roskoski,  1988)  and  we  have  previ¬ 
ously  shown  in  this  system  that  high  vitamin  C  doses 
cause  demonstrable  cell  death  (Boskey  et  al.,  1991b). 
The  vitamin  C-sulfate  concentrations  used  in  our  cul¬ 
ture  system,  however,  are  far  below  this  value.  With 
the  lower  concentrations  of  vitamin  C-sulfate  used  in 
these  cultures,  there  was  no  EM  evidence  of  cell 
death  and  the  vitamin-C  concentrations  were  1/8 
that  previously  shown  to  cause  cell  necrosis  in  this 
system.  Thus,  while  viable  cells  are  required  for  carti¬ 
lage  calcification  (Boskey  et  al.,  1996),  the  failure  to 
mineralize  does  not  appear  to  be  due  to  cell  death. 
The  use  of  vitamin  C-sulfate  did  Slightly  (but  not 
significantly)  decrease  the  extent  of  prolylhydroxyla- 
tion,  suggesting  that  collagen  production  was  not  ex¬ 
tensively  altered.  The  lack  of  significance  may  be 
related  to  the  decreased  weight  of  cultures  treated 
with  sulfated  vitamin  C,  however,  the  ~  40%  differ¬ 
ence  in  weight  is  most  likely  attributable  to  the  pres¬ 
ence  of  mineral  in  the  vitamin  C  treated  cultures  and 
not  to  different  amounts  of  collagen  being  present. 

Vitamin  C-sulfate  is  a  substrate  for  sulfatases 
(Fluharty  et  al.,  1976;  Hatanaka  and  Egami,  1976; 
Roy,  1979)  that  are  abundant  in  cartilage  (Schwartz 
and  Adamy,  1976)  and  most  likely  in  this  culture 
system.  In  cartilage,  the  proteins  sulfated  are  most 
likely  the  proteoglycans,  which'  are  known  inhibitors 
of  calcification  both  in  solution  (Boskey,  1989;  Boskey 
et  al.,  1997a),  in  animal  models  (Boskey  et  al.,  1991a) 
and  in  this  culture  system  (Boskey  et  al,  1997b).  Total 
proteoglycan  content  was  not  decreased  in  the  pres¬ 
ence  of  vitamin  C-sulfate,  however,  sulfate  incorpora¬ 
tion  into :  total  proteoglycans  was  significantly  in¬ 
creased,  suggesting  that  increased  Ca-chelation  by  the 
over-sulfated  proteoglycans  is  the  major  factor  caus¬ 
ing  decreased  mineralization. 

.Shapiro  and  Poon  (1975)  noted  that  the  sulfate 
transfer  from  vitamin  C-sulfate  to  chondroitin  sulfate 
proteoglycans  in  chondrocyte  cultures  did  not  occur 
by  direct  transfer,  but  was  accounted  for  by  decompo¬ 
sition  during  storage  at  -20°C  for  several  days.  It  is, 
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therefore,  likely  that  in  the  present  study,  the  decom¬ 
position  of  the  vitamin  C-sulfate  results  in  increased 
sulfation  of  proteoglycans  (Brand  et  al.,  1989;  Silbert 
and  DeLuca,  1969;  van  der  Kraan  et  al.,  1988),  hence, 
the  high 45  Ca  uptake  in  the  non-mineralizing  controls, 
reflecting  Ca  chelation  by  the  matrix.  Calcium  chela¬ 
tion  by  the  more  highly  sulfated  proteoglycans  could 
contribute  to  the  failure  of  those  cultures  to  support 
mineralization.  Because  it  is  tempting  to  use  a  stable 
as  contrasted  with  an  unstable  essential  additive,  it  is 
important  to  consider  the  consequences  of  a  seem¬ 
ingly  innocuous  medium  supplement  such  as  vitamin 
C-sulfate  on  culture  behavior.  • 
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ABSTRACT 

Fracture  susceptibility  depends  jointly  on  bone  mineral  content  (BMC),  gross  bone  anatomy,  and  bone 
microarchitecture  and  quality.  Overall,  it  has  been  estimated  that  50-70%  of  bone  strength  is  determined 
genetically.  Because  of  the  difficulty  of  performing  studies  of  the  genetics  of  bone  strength  in  humans,  we  have 
used  the  HcB/Dem  series  of  recombinant  congenic  (RC)  mice  to  investigate  this  phenotype.  We  performed  a 
comprehensive  phenotypic  analysis  of  the  HcB/Dem  strains  including  morphological  analysis  of  long  bones, 
measurement  of  ash  percentage,  and  biomechanical  testing.  Body  mass,  ash  percentage,  and  moment  of  inertia 
each  correlated  moderately  but  imperfectly  with  biomechanical  performance.  Several  chromosome  regions,  on 
chromosomes  1,  2,  8,  10,  11,  and  12,  show  sufficient  evidence  of  linkage  to  warrant  closer  examination  in 
further  crosses.  These  studies  support  the  view  that  mineral  content,  diaphyseal  diameter,  and  additional 
nonmineral  material  properties  contributing  to  overall  bone  strength  are  controlled  by  distinct  sets  of  genes. 
-Morebver,  the  mapping  data  are  consistent  with  the  existence  of  pleiotropic  loci  for  bone  strength-related 
phendiy^ES.  These  findings  show  the  importance  of  factors  other  than  mineral  content  in  determining  skeletal 
”  performance  and  that  these  factors  can  be  dissected  genetically.  (J  Bone  Miner  Res  2001;16:992-1003) 

Key  words:  skeleton,  fractures,  biomechanics,  linkage  genetics,  quantitative  trait 


INTRODUCTION 

Fractures  occurring  as  a  consequence  of  skeletal  fra¬ 
gility  are  an  important  health  problem!1'21  Yet,  rela¬ 
tively  little  is  known  regarding  the  determination  of  bone 
strength.  Most  investigations  have  focused  on  bone  mineral 
density  (BMD),  because  this  is  a  phenotype  that  is  easily 
and  reproducibly  measured'11  and  for  which  pharmacologic 
interventions  are  available!21  However,  although  BMD  pos¬ 
sesses  some  predictive  value  with  regard  to  human  fracture 


■  risk,  other  risk  factors  have  been  identified,  including  past 
fracture  history,  maternal  fracture  history,  visual  acuity, 
height,  level  of  physical  activity,  general  health  status,  and 
use  of  steroids  or  anticonvulsants!31  From  epidemiological 
surveys,  some  of  these  additional  risk  factors  appear  to  be 
comparable  in  importance  with  BMD  but  have  not  been 
studied  in  nearly  as  great  detail. 

Although  the  morbidity  and  mortality  arising  from  skel¬ 
etal  fragility  are  suffered  predominantly  by  the  elderly, 
many  aspects  of  bone  strength  are  determined  early  in 
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(4)  Bone  mass  peaks  at  approximately  age  30  years,  with 
life  mineral  loss  occurring  over  the  remainder  of  the  life 
b0T1]  Both  peak  bone  mass  achieved  and  rate  at  which  bone 
loss  occurs  are  complex  traits,  determined  by  interactions 
between  genetic  constitution  and  environmental  factors. 
Overall,  it  has  been  estimated  that  50-70%  of  bone  strength 
is  determined  genetically/5’6-155 

Because  of  the  difficulty  in  performing  studies  of  the 
genetics  of  bone  strength  in  humans,  we  have  used  the 
HcB/Dem  series  of  recombinant  congenic  (RC)  mice  to 
investigate  this  phenotype.  The  RC  mice  used  in  this  report 
are  inbred  lines  descended  from  third  generation  backcross 
brother-sister  pairs  in  which  strain  C3H/DiSnA  (C3H)  was 
the  background  progenitor  and  C57BL/ScSnA  (BIO)  was 
the  donor  progenitor.0.6-185  Each  of  the  27  HcB/Dem  strains 
carries  an  average  of  12.5%  BIO  genome,  but  each  has  a 
different  combination  of  donor  and  recipient  strain  alleles. 
The  chromosome  regions  derived  from  each  progenitor  can 
be  determined  by  genotyping  for  markers  distributed 
throughout  the  genome.  Because  they  are  inbred,  the  results 
of  mapping  apply  to  all  members  of  the  strain  and  are 
cumulative.  Phenotyping  RC  strains  allows  one  to  include 
multiple  individuals  as  replicates,  allowing  generation  of 
sample  sizes  that  are  sufficiently  large  to  establish  the 
presence  of  small  differences  between  individual  strains, 
just  as  is  true  of  any  inbred  strain. 

The  experiments  reported  here  were  performed  to  achieve 
three  objectives.  First,  we  sought  to  describe  the  long  bone 
properties  of  C3H,  BIO,  and  24  of  the  HcB/Dem  strains. 
Failure  load,  structural  stiffness,  failure  stress,  and  modulus 
,  were  determined  through  three-point  bend  tests  of  the  ani¬ 
mals1'  left  humeri.  Middiaphyseal  inner  diameter  (ID)  and 
outer  'diijjpeter  (OD)  cross-sectional  area  (CSA),  and  mo¬ 
ment  6f  inertia  (/)  were  determined  through  image  analysis 
‘of  radiographs..  Ash  percentage  of  femora  from  the  same 
animals  was  determined  gravimetrically.  Body  mass  and 
body  mass  index  (BMI)  were  determined  by  weighing  and 
measuring  animals  at  death.  Second,  these  phenotypic  data 
and  the  published  HcB/Dem  genotypic  data09,205  were  used 
to  perform  quantitative  trait  locus  (QTL)  linkage  mapping 
of  the  genes  that  contribute  to  the  measured  phenotypes. 
Third,  we  investigated  the  interrelationships  among  the  var¬ 
ious  bone  properties  by  developing  multivariate  regression 
models  of  failure  load  and  by  examining  patterns  of  pleio- 
tropy  among  the  QTLs  discovered  by  linkage  analysis.  The 
ability  to  define  components  of  biomechanical  performance 
and  map  them  individually  contributes  to  a  deeper  under¬ 
standing  of  how  genetic  factors  determine  bone  strength 
than  would  the  study  of  any  single  parameter.  This  com¬ 
prehensive  approach  allows  simultaneous  estimation  of  the 
degree  to  which  individual  parameters  share  common  ge¬ 
netic  determinants  and  the  contribution  of  each  identified 
genomic  region  to  biomechanical  performance. 
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N3  backcross  animals.06-185  The  HcB/Dem  strains  have 
been  genotyped  at  130  marker  loci  distributed  over  each  of 
the  autosomes.119,205  Because  they  are  inbred,  individuals 
from  a  single  strain  have  the  same  genetic  composition, 
except  for  new  mutations  and  residual  unfixed  chromosome 
segments. (2t,22)  Less  than  5%  of  the  genome  was  unfixed  at 
tire  time  of  genotyping;  residual  heterozygosity  is  expected 
to  be  reduced  by  half  in  each  generation  of  inbreeding.  The 
mice  described  in  this  report  were  provided  by  Dr.  Peter 
Demant  and  maintained  at  the  Hospital  for  Special  Surgery 
until  6-7  months  of  age  under  12  h  light-dark  cycling  and 
fed  irradiated  PICO  5058  rodent  chow  (Purina,  St.  Louis, 
MO,  USA)  and  autoclaved  tap  water  ad  lib.  Beamer  and 
colleagues  have  reported  that  although  mice  continue  to 
grow  over  the  lifespan,  peak  bone  mass  is  achieved  at  4 
months  of  age/235  We  examined  strains  HcB/1  through 
HcB/9,  HcBll  through  HcB/15,  HcB/17,  HcB/18,  HcB/20 
through  HcB/23,  HcB/25,  HcB/26,  HcB/28,  and  HcB/29. 
HcB/16  is  extinct  and  HcB/27  is  an  unassigned  number;  so 
HcB/10,  HcB/19,  and  HcB/24  were  unexamined  because 
these  strains  were  unavailable  at  the  time  the  experiments 
described  here  were  performed.  Between  4  and  10  animals 
of  each  strain  were  studied,  yielding  179  HcB/Dem  mice. 
Animals  were  allowed  ad  lib  activity  and  were  housed  1-5 
animals/cage.  Only  females  were  studied,  because  inclusion 
of  males  would  have  introduced  sex-dependent  variability 
in  the  traits  in  addition  to  the  strain-specific  and  environ¬ 
mental  variability  already  encountered.  At  death,  body  mass 
and  rostroanal  length  were  measured.  This  work  satisfied 
The  Hospital  for  Special  Surgery’s  requirements  for  the 
ethical  use  of  laboratory  research  animals. 

Ash  percentage 

Bone  mineral  fraction  was  calculated  by  comparison  of 
dry,  defatted  bone  weight  to  ash  weight  of  homogenized 
tissue/24,255  We  chose  to  use  entire  bones  rather  than  bones 
from  which  the  epiphyses  and  marrow  have  been  removed 
because  tire  former  technique  is  more  reproducible  in  our 
hands.  No  ash  data  were  obtained  from  the  HcB/6  strain 
because  of  a  laboratory  accident. . 

Radiographic  analysis 

Image  analysis  of  fine  focus  contact  radiographs  of  dis¬ 
sected  humeri  was  performed  as'  described  and  used  to 
calculate  CSA  and  //25)  Humeral  length  was  defined  as  the 
distance  along  the  diaphysis  from  tire  trochlea  to  the  hu¬ 
meral  head’s  most  distant  point.  ODs  and  IDs  were  mea¬ 
sured  in  orthogonal  projections  just  distal  to  the  deltoid 
tuberosity,  perpendicular  to  the  diaphyseal  axis.  CSA  was 
calculated  according  to  the  elliptical  approximation/255 

CSA  =  t t(MLOR*APOR  -  ML1R*  AP  IR), 


MATERIALS  AND  METHODS 

Mice 

The  HcB/Dem  strains  were  established  and  are  main- 
temed  at  the  Netherlands  Cancer  Institute.  Briefly,  the  HcB/ 
em  mice  are  inbred  strains  derived  from  arbitrary  pairs  of 


where  OR  is  the  outer  radius  and  IR  is  tire  inner  radius  in 
either  the  mediolateral  (ML)  or  anteroposterior  (AP)  pro¬ 
jection.  I  also  was  calculated  according  to  the  elliptical 
approximation1265 

/  =  7t/4[(ML  OR)3(AP  OR  -  (ML  IR)3(AP  IR)]. 
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Image  analysis  was  performed  with  SigmaScan  (Jandel 
Scientific)  image  analysis  software.  All  radiographs  in¬ 
cluded  stepped  aluminum  densitometric  phantoms.  Radio- 
graphic  images  were  digitized  with  a  digital  camera  (Kodak, 
Rochester,  NY,  USA),  with  the  photographic  field  including 
a  length  scale. 

Biomechanical  testing 

Quasi-static  three-point  bend  testing  was  performed  on 
left  humeri  using  posts  designed  and  machined  in-house 
with  the  MTS  apparatus  (MTS  Systems  Corp.,  Eden  Prairie, 
MN,  USA)  and  Instron  (Canton,  MA,  USA)  electronics  as 
described. (27-28)  Humeri  were  oriented  with  the  deltoid  tu¬ 
berosity  downward  and  the  specimens  were  oriented  with 
the  central  post  adjacent  to  the  distal  end  of  the  deltoid 
tuberosity.  This  orientation  corresponds  to  the  ML  axis 
being  parallel  to  the  applied  force.  Posts  were  separated  by 
3.75  mm  except  for  the  BIO  specimens,  which  were  tested 
at  a  post  separation  of  3.0  mm. 

Biomechanical  data  were  analyzed  following  several  im¬ 
portant  assumptions.  First,  we  assumed  that  bone  strength  is 
determined  entirely  by  the  cortical  bone  in  the  middiaphy- 
sis.  Second,  we  assumed  that  the  humeral  diaphysis  is  an 
ellipse  with  its  major  axis  lying  in  the  ML  plane  and  its 
minor  axis  lying  in  the  AP  axis.  Calculated  biomechanical 
parameters  were  obtained  according  to  the  following  stan- 
■  dard  formulas  for  three-point  bending  of  ellipses065:  stress 
(<r,  MPa)  =  FLdAl,  where  F  is  force,  L,  is  length,  and  c  is 
ML  OR;  strain  (e;  mm/mm)  =  \2cdR\ttv1,  where  c  is  ML 

•  ■  OR,  d  is  displacement,  and  L  is  length;  and  Young’s  Mod- 
k  ulus  (E;  MPa)  =  (F/day)(L3/48/). 

.  Linkage  analysis 

Linkage  mapping  was  carried  put  using  the  QTL  cartog¬ 
rapher  software  suite,09'305  using  sib-recombinant  inbred  as 
the  cross  type.  The  breeding  scheme  of  RC  mice  is  equiv¬ 
alent  to  that  for  recombinant  inbred  mice  except  for  the  fact 
that  full-sib  matings  are  started  after  the  third  backcross 
rather  than  the  second  filial  generation.  This  breeding 
scheme  results  in  multiple  opportunities  for  recombination 
to  occur  before  fixation  of  tire  genotype.  Consequently, 
linkage  relationships  are  weakened  according  to  the  rela¬ 
tionship  R  ~  4r/(l  4-  6r),  where  R  is  the  observed  fraction 

•  of  recombinant  genotypes  and  r  is  the  single  generation 
recombination  fraction  between  a  pair  of  Ioci.°1,22;  Briefly, 
data  were  first  analyzed  for  normality  and  those  traits  for 
which  Fisher’s  cumulant  test  for  normality015  was  signifi¬ 
cant  at  the  5%  level  were  log-transformed  (failure  stress, 
modulus,  body  mass,  and  BMI)  before  further  analysis. 
Transformed  data  were  distributed  normally  by  this  crite¬ 
rion.  Interval  mapping  was  carried  out  to  generate  the  final 
model.02,335  A  permutation  test  using  1000  simulated  data 
sets  was  performed  to  estimate  empirically  significance 
levels. 04,355  Linkage  maps  were  plotted  with  Gnupiot.065 
Linkage  maps  were  generated  for  strain-averaged  data. 

Other  statistical  analysis 

All  values  are  shown  as  the  mean  ±  SD.  Continuous 
variables  were  analyzed  by  analysis  of  variance  (ANOVA) 
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FIG.  1.  Body  masses,  humeral  CSAs,  and  humeral  fs  for  the  HcB / 
Dem  strains.  Each  bar  graph  shows  the  average  ±1  SD  for  C3H/ 
DiSnA,  C57BL/10ScSnA,  and  each  of  the  HcB  strains  tested.  Sample 
sizes  range  between  4  and  10  animals/strain. 


/  test.  Phenotypes  that  were  not  distributed  normally  were 
log-transformed  before  analysis.  In  the  specific  case  of 
BMI,  the  transform  was  log(l  +  BMI).  In  general,  the 
a-level  was  0,05  with  adjustment  for  multiple  comparisons. 
We  used  linear  regression  with  stepwise  addition  and  back¬ 
ward  elimination  to  generate  tire  multivariate  model  for 
failure  load  as  a  function  of  the  other  parameters,  using  an 
initial  inclusion  criterion  of  p  <  0.047  for  adding  parame¬ 
ters  and  a  criterion  of  p  <  0.05  for  their  retention. 


RESULTS 

Size  and  skeletal  morphology 

The  body  masses,  humeral  CSAs,  and  Is  are  shown  in  Fig. 
1.  The  parental  strains  do  not  differ  with  respect  to  body 
mass  or  CS  A,  with  C3H  mice  having  a  body  mass  of  23.2  ± 
1.5  g  and  a  CSA  of  0.53  ±  0.03  mm2  and  B10  mice  having 
a  body  mass  of  25.4  ±  1.4  g  and  a  CSA  of  0.55  ±  0.04 
mm2.  However,  they  differ  markedly  with  respect  to  I 
(0.044  ±  0.005  mm4  for  C3H  and  0.089  ±  0.007  mm4  for 
B10).  Although  only  I  differs  between  the  parental  strains, 
each  of  these  parameters  varies  markedly  among  the  HcB/ 
Dem  strains.  Minimum  and  maximum  values  for  the  various 
parameters  and  the  corresponding  strains  are  shown  in  Ta¬ 
ble  1.  Also  included  in  Table  lNare  summary  data  for  tire 
diaphyseal  diameters  used  to  calculate  CSA  and  /.  The 
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Table  1.  Parental  Phenotypes  and  Phenotypic  Ranges 


Parental s 


All  strains 


Body  mass  (g) 

BMI  (g/cm2) 

Humeral  ML  OD  (mm) 
Humeral  AP  OD  (mm) 
Humeral  ML  ID  (mm) 
Humeral  AP  ID  (mm) 
Humeral  CSA  (mm2) 
Humeral  1  (mm4) 

Femoral  ash  percentage 
Failure  load  (N) 

Structural  stiffness  (N/mm) 
Failure  stress  (MPa) 
Young’s  modulus  (MPa) 


C3H/DiSnA 

(C3H) 

23.2  ±  1.5 
3.270  ±  0.014 
1.06  ±  0.03 
0.83  ±  0.04 
0.53  ±  0.05 
0.39  ±  0.03 
0.53  ±  0.03 
1.044  ±  .005 
69.1  ±  1.1 
7.4  ±  0.3 
17.9  ±  3.7 
173  ±  14 
3570  ±1160 


C57BU 

lOScSnA 

(B10) 

25.4  ±  1.4 
0.295  ±  0.01 
1.33  ±  0.(F 
0.96  ±  0.0^ 
0.89  ±  0.0$ 
0.65  ±  0.0t 
0.55  ±  0.04 
0.089  ±  .001 

64.2  ±  1.9 
7.7  ±  1.4 

21.2  ±  12.S 

87  ±  11 

1130  ±700 


Minimum  value 
and  strain 

8.2  ±  0.5  HcB/13 
>32  ±  0.010  HcB/25 
.02  ±  0.06  HcB/13 
.81  ±  0.05  HcB/8 
.46  ±  0.06  HcB/22 
.31  ±  0.04  HcB/3 
.49  ±  0.06  HcB/8 
)40  ±  .007  HcB/13 

4.2  ±  1.9  B10 

6.1  ±  1.0  HcB/13 

7.2  ±  5.2  HcB/18 
87  ±  11  B10 

30  ±700  B10 


Maximum  value 
and  strain 

27.9  ±  1.3  HcB/6 
311  ±0.009  HcB/3 
1.33  ±0.04  B10 
3.98  ±  0.02  HcB/14 
3.89  ±0.09  B10 
3.65  ±0.04  B10 
3.72  ±  0.05  HcB/14 
089  ±  .007  B10 
70.2  ±  1.5  HcB/23 
10.4  ±  1.2  HcB/14 
10.8  ±  8.5  HcB/14 
192  ±  36  HcB/8 
340  ±  880  HcB/26 


FIG.  2.  Humeri  of  the  parental  strains.  ML  contact  radiographs  of  (A) 
C3H/DiSnA  and  (B)  C57BL/10ScSnA  are  shown. 


distribution  of  values  among  the  HcB/Dem  strains  is  con¬ 
sistent  with  segregation  between  the  parental  strains  con¬ 
tributing  to  each  of  these  phenotypes.  Body  masses  of  the 
HcB/Dem  strains  ranged  between  18.2  ±  0.5  g  (HcB/13) 
and  27.9  ±  1.3  g  (HcB/6).  Most  HcB/Dem  strains  had  larger 
CSAs  than  the  parental  strains,  ranging  from  a  minimum  of 
0.49  ±  0.06  mm2  (HcB/8)  to  a  maximum  of  0.72  ±  0.05 
mm2  (HcB/14). 

The  magnitude  of  the  differences  in  CSA  and  I  is  shown 
in  Fig.  2.  Although  CSA  is  equal  between  the  parental 
strains,  the  source  of  their  marked  difference  in  /is  apparent 
rom  this  figure,  which  shows  the  striking  dissimilarity  of 
their  diaphyseal  diameters.  Thus,  although  C3H  mice  have 
B  n1Ck  cortex  and  a  small  diaphyseal  diameter, 

mice  have  the  converse  phenotype.  This  is  reflected  in 
’  w“lc*t  is  proportional  to  the  product  of  CSA  and  the 
^le  ^'Stance  °f  that  CSA  from  the  bending  axis  for 
the  h  /r,°'nt  kending  test.  The  divergence  of  CSA  among 
cB/Dem  strains  arises  as  a  result  of  dissociation  of 


diaphyseal  diameter  from  cortical  bone  thickness.  HcB/14 
has  both  a  thick  diaphyseal  cortex  and  a  large  overall 
diameter,  while  HcB/8  is  characterized  by  a  thin  cortex  and 
a  small  overall  diameter. 


Biomechanical  performance  and  ash  analysis 

We  performed  quasi-static  three-point  bending  tests  of 
the  left  humeri  for  each  animal.  In  this  test,  the  fracture  is 
initiated  in  the  plane  of  contact  between  the  specimen  and 
the  central  post  of  the  apparatus.  This  protocol  allows  the 
central  post  to  be  placed  at  a  recognizable  anatomical  site 
and  measurements  of  the  bone  can  be  made  at  the  same  site. 
Results  of  these  studies  are  summarized  in  Table  1  and  Fig. 
3.  Failure  load  is  one  of  the  most  basic  measures  of  struc¬ 
tural  strength — the  force  needed  to  fracture  the  bone.  The 
parental  strains  do  not  differ  in  this  parameter,  with  C3H 
mice  having  a  failure  load  of  7.4  ±  0.3  N  and  B10  mice 
having  a  failure  load  of  7.7  ±  1.4  N.  Nevertheless,  a  wide 
range  of  failure  loads  was  observed  over  the  HcB/Dem 
strains,  from  HcB/13’s  value  of  6.1  ±  1.0  N  to  HcB/14’s 
value  of  10.4  ±  1.2  N.  A  similar  pattern  is  seen  for  struc¬ 
tural  stiffness  as  well,  with  C3H  having  a  value  of  17.9  ± 
3.7  N/mm  and  B10  having  a  value  of  21.2  ±  12.9  N/mm. 
The  most  divergent  strains  are  HcB/18  (17.2  ±  5.2  N/mm) 
and  HcB/14  (40.8  ±  8.5  N/mm). 

It  is  useful  to  distinguish  between  structural  and  material 
properties  in  considering  biomechanical  testing  data.  Struc¬ 
tural  properties  characterize  the  specimen  as  a  whole,  en¬ 
compassing  both  its  anatomy  and  its  material.  The  structural 
properties  are  measured  directly  during  the  performance  of 
the  test.  Material  properties,  in  contrast,  are  calculated  from 
the  measured  structural  properties  and  the  measured  ana¬ 
tomical  parameters.  Failure  stress  is  the  material  analogue  to 
failure  load  and  corrects  for  a  specimen’s  CSA  and  diameter 
at  the  break  point.  In  contrast  to  the  data  for  failure  load,  the 
parental  strains  have  widely  divergent  failure  stresses, 
173  ±  14  MPa  for  C3H  and  87  ±  11  MPa  fot  B10.  BlO’s 
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FlGi  3.  Biomechanical  performance  of  the  HcB/Dem  strains.  Each 
bar  graph  shows  the  average  ±1  SD  for  C3H/DiSnA,  C57BL/ 
lOScSnA,  and  each  of  the  HcB  strains  tested.  Sample  sizes  range 
between  4  and  10  animals/strain. 


failure  stress  is  the  minimum  for  the  strains  studied 'and 
HcB/8’s  value  of  192  ±  36  MPa  is  the  maximum.  Young’s 
modulus  is  the  material  measure  of  stiffness;  the  parental 
strains’  difference  in  failure  stress  is  reflected  by  their 
moduli  as  well,  with  C3H  having  a  modulus  of  3570  ±  1160 
MPa- and  BIO  having  a  modulus  of  1130  ±  700  MPa.  B10 
has  the  least  stiff  bone  tissue  observed  in  this  study,  whereas 
HcB/26,  with  a  modulus  of  5340  ±  880  MPa,  has  the 
stiffest  bone  tissue  observed.  These  data  show  that  both 
structural  aftd  material  strength  and  stiffness  vary  widely 
among  the  HcB/Dem  strains. 

As  a  first  step  in  characterizing  the  contribution  of  mate¬ 
rial  properties  to  overall  biomechanical  performance,  we 
performed  ash  analysis  of  femora  from  the  same  animals 
used  for  biomechanical  testing.  Ash  percentage  reflects  the 
mineral  content  of  the  bone  tissue  and  is  the  parameter  most 
closely  related  to  clinical  measurements  of  BMD  or  bone 
mineral  content  (BMC).  The  results  are  summarized  in 
Table  1  arid  Fig.  3  and  show  that  C3H,  with  a  value  of 
69.1  ±  1.1%,  and  BIO,  with  a  value  of  64.2  ±  1.9%,  differ 
markedly  for  this  parameter.  BIO  has  the  minimum  ash 


percentage  of  the  strains  studied  and  HcB/23,  with  a  value 
of  70.2  ±  1.5%,  has  the  maximum  ash  percentage.  Thus, 
substantial  variations  exist  in  the  mineral  content  of  bones 
from  the  HcB/Dem  strains. 

Contributions  of  the  measured  parameters  to  bone 
strength 

Data  presented  in  the  previous  section  show  that  marked 
variations  in  animal  size,  anatomy  as  reflected  by  diaphyseal 
diameter  and  I,  and  BMC  exist  among  the  HcB/Dem  strains 
and  each  of  these  are  expected  to  contribute  to  differences  in 
biomechanical  performance  among  them.  As  an  initial  step  in 
understanding  the  contribution  of  each  of  these  parameters  to 
biomechanical  performance,  we  performed  a  simple  correla¬ 
tion  analysis  of  each  of  the  parameters  in  a  pairwise  fashion, 
after  log-transforming  data  for  parameters  with  skewed  distri¬ 
butions.  Results  of  this  analysis  are  summarized  in  Table  2. 

We  used  these  data  to  guide  successive  addition/elimination 
and  elimination  only  stepwise  regression  analyses  of  failure 
load  as  a  linear  function  of  the  other  traits  (Table  3).  Such 
models  express  a  single  variable,  in  this  case  failure  load,  as  a 
linear  function  of  a  subset  of  the  oilier  measured  variables  and 
an  intercept  given  by  the  constant  term  of  file  equation.  These 
other  variables  are  included  in  the  model  only  if  they  add  to  the 
model’s  explanatory  power,  as  reflected  by  an  increase  in  R2. 
A  perfect  model  has  an  R2  —  1.0  and  the  Rz  value  gives  the 
fraction  of  the  variation  in  failure  load  that  is  attributable  to 
variation  in  the  other  traits. 

In  the  first  model,  we  used  all  other  measured  parameters 
as  possible  independent  variables.  This  model  found  that 
over  90%  of  the  failure  load  could  be  accounted  for  as  a 
function  of  log(failure  stress),  CSA,  and  I.  The  model’s  R2 
=  0.906  and  its  equation  is 

failure  load  =  —39.83  +  18.63  logffailure  stress) 

+  5.93  CSA  +  67.77 1. 

This  model,  although  accounting  for  over  90%  of  the  failure 
stress,  is  limited  by  the  fact  that  failure  stress  is  a  function 
of  load  and  7. 

In  the  second  model,  we  allowed  only  variables  that  were 
measured  directly  rather  than  calculated  from  measured 
quantities.  This  model  had  poorer  predictive  ability,  ac¬ 
counting  for  nearly  57%  of  failure  load.  Its  R2  =  0.568  and 
its  equation  is  given  by 

failure  load  =  -13.48  +  5.51  log(structural  stiffness) 

+  0.11  ash%  +  4.19  log(body  mass)  +  1.97  AP  ID. 

The  majority  of  the  explanatory  power  of  this  model  came 
from  the  structural  stiffness,  which  by  itself  explained  50% 
of  the  failure  load.  In  both  models,  the  same  results  were 
obtained  by  stepwise  addition/elimination  and  elimination- 
only  analyses. 

Mapping  of  loci  contributing  to  the  phenotypes 

Previously  published  genotypic  data  were  used  with  the 
phenotypic  analysis  described  previously  to  map  loci  con- 
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Table  2.  Correlations  Among  Bone  Parameters 


parameters 

Structure 

stiffness 

Stress 

Modulus 

Ash  % 

I 

CSA 

MLOD 

ML  IP 

APOD 

APID 

Body 

moss 

BMI 

Load 

0.714 

0.236 

0.272 

0.505 

0.707 

0.726 

0.655 

0.275 

0.442 

0.294 

0.454 

0.236 

<0.001 

0.002 

<0.001 

0.014 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

0.002 

Structural 

— 

NS 

0.581 

0.168 

0.461 

0.484 

0.456 

0.227 

0.437 

0.293 

0.363 

0.171 

stiffness 

<0.001 

0.03 

<0.001 

<0.001 

<0.001 

0.003 

<0.001 

<0.001 

<0.001 

0.023 

— 

0.685 

NS 

-0.674 

-0.641 

-0.636 

-0.331 

-0.523 

-0.303 

-0.153 

NS 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

0.046 

Modulus 

NS 

-0.388 

-0.328 

-0.412 

-0.272 

-0.211 

NS 

NS 

NS 

<0.001 

<0.001 

<0.001 

<0.001 

0.006 

-  - 

NS 

-0.449 

NS' 

NS  ' 

0.157 

NS 

0.269 

NS 

0.032 

0.044 

0.001 

/ 

— 

0.893 

0.959 

0.566 

0.716 

0.462 

0.461 

0.192 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

0.012 

CSA 

— 

0.787 

0.228 

0.798 

0.277 

0.507 

0.234 

<0.001 

<0.001 

<0.001 

<0.001 

<0.001 

0.002 

ML  OD 

— 

0.649 

0.533 

0.372 

0.454 

0.222 

<0.001 

<0.001 

<0.001 

<0.001 

0.003 

ML  ID 

— 

0.289 

0.267 

NS 

NS 

<0.001 

<0.001 

APOD 

* 

— 

0.666 

0.308 

NS 

<0.001 

<0.001 

APID 

— 

NS 

NS 

Body  mass 

— 

0.786 

- 

<0.001 

Parameters  were  log-transformed  before  analysis.  Significant  correlation  coefficients  are  shown  on  the  top  line  and  their  associated  p 
values  are  shown  on  the  second  line  of  each  cell. 


Table  3.  Stepwise  Regression  Models  of  Failure  Load 


Step.  , 

-  Parameter ' 

P 

Coefficient* 

AR2 

•  *  *•  1  \  • 

Model  1.  All  parameters  included  as  potential  independent  variables 

■  .1-  • 

Log(structural  stiffness) 

<0.001 

0.432 

2  -  ‘ 

Mlod 

<0.001 

0.0519 

3  "  ” 

Log(stress) 

<0.001 

18.626 

0.302 

4 

CSA 

<0.001 

5.927 

0.113 

5  . 

I 

<0.001 

67.774 

0.011 

6 

Remove  log(structural  stiffness) 

0.126 

-0.001 

7 

Remove  ML  OD 

0.059 

-0.002 

1 

Model  2.  Only  directly  measured  parameters 
Logfstmctural  stiffness) 

included  as  potential  independent  variables 
<0.001  5.514 

0.503 

2 

Logfbody  mass) 

<0.001 

4.192 

0.040 

3 

.  Ash% 

0.025 

0.111 

0.013 

4 

APID 

0.035 

1.973 

0.012 

a  The  coefficient 

is  included  only  for  those  parameters  retained  in  the  final  model. 

l  stiffness) 
1.97  AP  ID. 
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tributing  to  each  of  the  studied  traits.  The  mapping  data  for 
selected  chromosomes  are  summarized  in  Fig.  4.  Mapping 
data  for  other  traits  (data  not  shown)  reveal  that  LOD  score 
graphs  for  all  the  diaphyseal  diameters  closely  parallel  each 
other.  This  also  is  true  of  CSA,  I,  failure  stress,  and  mod¬ 
ulus,  which  are  calculated  using  the  diaphyseal  diameters. 
Because  these  parameters  are  codependent,  only  I  is  in¬ 
cluded  in  Fig.  4  along  with  the  independently  measured 
baits  of  failure  load,  structural  stiffness,  ash  percentage,  and 
Body  mass. 

We  performed  a  1000-iteration  permutation  test  to  esti¬ 
mate  experiment-wide  significance  levels  for  each  trait  as 


summarized  in  Table  4.  As  shown  in  Fig.  4,  none  of  the 
strain-averaged  QTL  peaks  achieved  experiment- wide  sta¬ 
tistical  significance, <37,38)  regions  on  multiple  chromosomes 
had  LOD  scores  in  excess  of  1.7.  An  LOD  a  1  threshold  is 
commonly  used  for  exploratory  genome  scans  of  complex 
traits  in  a  staged  searching  strategy/375  However,  given  that 
we  performed  linkage  mapping  for  five  independently  mea¬ 
sured  traits,  it  is  appropriate  to  raise  the  exploratory  thresh¬ 
old  to  the  Bonferroni-corrected  value  of  LOD  5:  1.7. 

On  several  chromosomes,  LOD  peaks  for  different  traits 
map  to  the  same  genomic  locations.  Overlapping  mapping 
assignments  are  consistent  with  the  existence  of  pleiotropic 
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Chromosome  1  Chromosome  2 


Chromosome  8  Chromosome  10 


Chromosome  11  Chromosome  12 


Q 


FIG.  4.  Linkage  maps  for  five  traits.  Linkage  maps  for  failure  load  (red  diamonds),  structural  stiffness  (green  +  signs),  ash  percentage  (blue 
squares),  I  (magenta  X’s),  and  body  mass  (navy  triangles)  are  shown  for  each  autosome  in  which  the  maximum  LOD  for  any  trait  exceeded  1.0. 
The  X  dimension  measures  position  in  morgans  (1  M  =  100  cM)  from  the  most  centromeric  marker.  No  “tails”  flanking  the  outermost  markers 
are  included  in  the  maps.  The  Y  dimension  measures  LOD  score.  Marker  positions  are  indicated  below  the  x  axis. 


r  ET  AL. 


PHENOTYPIC  ANALYSIS  OF  HcB/Dem  STRAINS 


999 


itage  (blue 
seeded  1.0. 
ist  markers 


Table  4.  Experimental- Wide  Significance  Levels 


Significance  level 

Load 

Structural  stiffness 

Ash% 

I 

Mass 

0.01 

3.45 

3.22 

2.96 

3.47 

3.52 

3.03 

2.77 

3.22 

3.17 

0.05 

2.71 

2.79 

2.55 

2.92 

2.88 

0.10 

2.45 

2.44 

2.41 

2.62 

2.59 

(affecting  multiple  traits)  QTLs  at  these  locations.  The 
degree  of  overlap  among  the  mapping  assignments  is  vari¬ 
able.  Failure  load  and  structural  stiffness  linkage  maps  are 
nearly  parallel  throughout  the  genome.  A  lesser  degree  of 
overlap  is  seen  between  failure  load  and  7,  and  even  less  is 
observed  between  failure  load  and  ash  percentage.  The 
linkage  data  suggest  that  there  are  distinct  but  overlapping 
sets  of  QTLs  that  contribute  to  each  of  the  bone  strength- 
related  traits  shown  in  the  HcB/Dem  system.  This  is  well 
illustrated  in  comparing  the  maps  of  chromosomes  1  and  10 
(Fig.  4).  On  chromosome  1,  a  peak  centered  at  DlMitlO 
includes  failure  load,  structural  stiffness,  7,  and  body  mass, 
but  not  ash  percentage.  Conversely,  the  peak  on  chromo¬ 
some  10  centered  on  D10Mit3  includes  failure  load,  struc¬ 
tural  stiffness,  and-  ash  percentage  but  not  7  or  body  mass. 

DISCUSSION 

This  report  presents  bone  phenotypes  for  24  of  the  27 
'  HcB/Dem  RC  strains.  These  strains  span  -nearly  a  2-fold 
..  .  .range  of  failure  loads  and  vary  widely  for  multiple  related 
-  "i  traits.  I^hmy  phenotype  pairs,  even  excluding  those  that  are 
: .  • .  •triviaH.y'related,  display  significant  correlations.  A  multivar- 
-  .,i.iate  linear  regression  model  including  only  traits  that  were 
measured  directly  accounts  for  more  than  half  the  difference 
in  failure  load  among  the  strains.  The  phenotypic  data  were 
used  to  generate  the  QTL  maps  for  five  independently 
measured  traits  in  the  HcB/Dem  strains  shown  in  Fig.  4. 

The  linkage  maps  reveal  that  potential  QTLs  for  distinct 
traits  sometimes  colocalize  to  a  single  position  in  the  ge-. 
nome,  which  we  interpret  as  the  presence  of  a  single  gene 
with  pleiotropic  effects  (affecting  multiple  traits)  at  each 
multitrait  LOD  peak.  An  alternative  explanation  for  colo¬ 
calization  is  that  distinct  but  closely  finked  QTLs  are  re¬ 
sponsible  for  each  of  the  phenotypes.  Although  Hie  pleio- 
tropy  and  multiple  linked  gene  models  cannot  be 
distinguished  by  our  data,  the  pleiotropy  model  is  both  more 
economical  and  consistent  with  present  understanding  of 
bone  strength.  Our  data  reveal  coincident  LOD  peaks  for 
distinct  subsets  of  the  traits  at  several  positions  in  the 
genome.  There  is  prior  evidence  relating  these  traits  to 
failure  load/26,28,39-445  The  existence  of  pleiotropic  QTLs 
would  provide  a  biological  basis  for  the  observed  clinical 
correlations  between  body  mass  and  bone  strength.  The 
biological  importance  of  these  putative  pleiotropic  loci  is 
reinforced  by  the  fact  that  they  are  based  on  very  different, 
independently  measured  phenotypes.  Mechanistically, 
pleiotropy  most  likely  reflects  the  many  developmental  and 
biochemical  steps  separating  the  products  of  the  putative 
pleiotropic  loci  and  the  measured  phenotypes.  If  one  accepts 


the  pleiotropy  interpretation,  it  also  is  worth  noting  that 
colocalization  of  LOD  peaks  for  multiple  independent  traits 
mitigates  lack  of  statistical  significance  for  linkage  assign¬ 
ments.  Permutation  tests  determine  the  frequency  with 
which  peaks  of  a  given  LOD  threshold  will  occur  anywhere 
in  the  genome  by  chance  alone.  Because  there  are  many 
genomic  locations  where  artifactual  LOD  peaks  may  occur, 
coincidence  of  peaks  for  multiple,  independently  measured 
traits  can  be  interpreted  more  plausibly  as  representing 
biology  rather  than  statistical  accident.  On  this  basis,  we 
would  predict  that  further  experiments  in  this  system  are 
more  likely  to  find  significant  QTLs  on  chromosomes  1, 10, 
and  11  than  on  chromosomes  2,  8,  and  12. 

In  die  linkage  analysis,  LOD  scores  may  be  increased 
because  of  epistasis  between  specific  locus  pairs.  In  previ¬ 
ous  work,  Demant  and  colleagues  showed  large  epistatic 
interactions  between  tumor  susceptibility  loci.(45l46)  The 
small  number  of  genotypes  studied  here  will  tend  to  mag¬ 
nify  the  contributions  of  individual  loci,  because  main  ef¬ 
fects  cannot  be  distinguished  from  interactions.  These  lim¬ 
itations  of  the  data  presented  here  can  best  be  addressed  by 
performing  additional  crosses;  these  ar$  in  progress. 

Genetic  differences  are.  believed  to  account  for  a  substan¬ 
tial  portion  of  the  variability  in  the  bone  strength  of  both 
mice  and  humans.  In  mice,  Beamer  and  colleagues  have 
measured  volumetric  BMD  and  cortical  thickness  by  quan¬ 
titative  computed  tomography  (CT)  scanning  in  a  panel  of 
11  inbred  mouse  strains,  noting  that  C57BL/6J  and  C3H/ 
HeJ  are  the  most  extreme  strains  in  their  sample  for  these 
parameters/235  These  strains  are  related  closely  to  the  pro¬ 
genitors  of  the  HcB/Dem  RC  strains.  These  investigators 
have  begun  to  map  volumetric  BMD  in  crosses  between 
C57BL/6J  and  CAST/ED(47)  and  between  C57BL/6J  and 
C3H/HeJ.t48)  In  these  crosses,  a  highly  significant  QTL 
centered  at  DIMitlS  falls  in  a  region  that  may  overlap  with 
our  LOD  peak  centered  at  DlMitlO.  Two  groups  have 
studied  the  genetic  basis  of  peak  bone  mass  in  the  SAM/P6 
senescence-accelerated,  osteoporotic  mouse.  Shimizu  and 
colleagues'495  used  a  midfemoral  cortical  thickness  index  as 
a  measure  of  bone  mass  in  a  cross  between  SAMP6  and 
S  AMP2,  reporting  a  highly  significant  QTL  on  chromosome 
1 1  between  DllMit90  and  D11MU59  that  may  overlap  our 
peak  on  that  chromosome.  Benes  et  al.  have  performed 
linkage  mapping  of  areal  BMD  in  crosses  of  SAMP6  with 
SAMR1  and  AKR/J.(S0>  These  workers  also  found  a  signif¬ 
icant  QTL  in  the  corresponding  chromosome  11  region. 
Klein  and  colleagues  analyzed  areal  (i.e.,  projected)  BMD 
in  the  B  X  D  recombinant  inbred  strains  as  measured  by 
dual-energy  X-ray  absorptiometry. (51)  Tins  study,  like  ours, 
was  exploratory  in  nature  because  of  the  limited  number  of 
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genotypes  examined.  These  related  investigations  have 
identified  genomic  regions  that  overlap  partially  with  our 
results,  providing  independent  evidence  that  LOD  score 
peaks  noted  in  the  HcB/Dem  system  reflect  true  QTLs. 

It  is  important  to  note  two  important  differences  between 
our  experiments  and  the  work  performed  by  the  other 
groups.  First,  although  the  phenotypes  investigated  are  re¬ 
lated,  they  are  not  the  same  and  therefore  may  be  under 
somewhat  different  genetic  control.  Second,  the  progenitors 
in  each  investigation  are  different.  Consequently,  in  each 
system,  a  different  set  of  loci  and  alleles  segregates.  These 
caveats  are  particularly  important  in  relating  the  data  pre¬ 
sented  here  with  other  mapping  studies.  Colocalization  of 
putative  QTLs  from  different  experiments,  although  provid¬ 
ing  evidence  that  a  relevant  gene  is  present,  still  requires 
caution  if  the  phenotypic  measures  differ,  as  is  true  for 
chromosomes  1  and  11.  Conversely,  it  is  not  surprising  that 
some  distinct  loci  contribute  to  bone  strength  in  each  ex¬ 
periment,  given  the  diversity  of  traits  and  strains  studied. 

A  human  locus  controlling  bone  mass  ( HBM)  has  been 
mapped  to  chromosome  11  ql2-13(525  and  there  is  evidence 
that  this  locus  may  account  for  variation  in  bone  mass  in  a 
broader-based  population  as  well.<53)  The  murine  homo- 
logue  to  the  human  HBM  locus  is  predicted  to  map  to  near 
the  centromere  of  chromosome  19. 

Thus  far,  mapping  studies  of  bone  properties  have  been 
heavily  weighted  toward  radiographically  determined  mineral 
content  as  the  phenotype.  There  are  good  reasons  for  this,  most 
notably  the  existence  of  several  well-characterized  precise 
methods  for  measuring  this  parameter.  However,  as  illustrated 
by  the  data  presented  here  and  consistent  with  clinical  experi¬ 
ence  in  humans,  mineral  content  is  only  one  among  several 
.'•factors  contributing  to  overall  bone  strength  and  fracture 
risk/3'40'43,545  Our  observations  also  suggest  that  material  prop¬ 
erties  of  bone  that  were  not  investigated  contribute  to  differ¬ 
ences  of  failure  stress  and  Young’s  modulus  as  well  as  failure 
load.  These  might  include  crystal  size  and  morphology,  degree 
of  crystal  perfection,  degree  of  substitution  of  carbonate  in  the 
mineral,  degree  or  pattern  of  collagen  cross-linking,  and  dif¬ 
ferences  in  noncollagenous  proteins.  In  this  study,  we  have 
exploited  the  genetic  homogeneity  of  inbred  mice  to  allow 
biomechanical  data  to  be  used  as  a  phenotype  in  a  preliminary 
mapping  study.  The  HcB/Dem  RC  strains  each  contain  a 
distinct  complement  of  BIO  alleles  on  a  background  of  the 
C3H  genome.  Both  progenitors  are  “wild  type,”  lacking  mu¬ 
tations  affecting  skeletal  structure,  function,  or  development  in 
an  obvious  fashion.  Yet,  the  cumulative  effects  of  allelic  dif¬ 
ferences  between  these  strains  lead  to  quite  dramatic  differ¬ 
ences  in  the  cortical  bone  properties  of  adult  mice. 

Recently,  increased  attention  has  been  focused  on  how 
anatomy  affects  bone  strength  in  humans.  Myers  and  col¬ 
leagues'555  related  load  to  fracture  for  cadaveric  human 
forearms  to  BMD,  BMC,  CSA,  and  I.  They  found  that  both 
CSA  and  /  were  more  predictive  of  biomechanical  perfor¬ 
mance  than  either  BMD  or  BMC  measured  by  dual-energy 
X-ray  absorptiometry,  consistent  with  our  phenotypic  data. 
Several  groups  have  suggested  that  a  longer  hip  axis  length 
(HAL)  increases  the  risk  of  femoral  neck  fractures/56-  665 
Bell  and  colleagues  have  reported  cross-sectional  data  sug¬ 
gesting  that  loss  of  cortical  bone  mass  in  the  anteroinferior- 
posterosuperior  axis  of  the  femoral  neck  increases  the  risk 


of  hip  fracture/675  They  hypothesize  that  formation  of  “gi¬ 
ant”  Haversian  canals  in  the  femoral  neck  contributes  to  the 
specific  loss  of  cortical  bone  mass/685  Duan  and  colleagues 
assessed  the  contributions  of  reduced  bone  size  and  reduced 
volumetric  BMD  to  vertebral  fracture  risk/695  These  inves¬ 
tigators  found  in  a  cross-sectional  sample  that  both  contrib¬ 
uted  to  fracture  risk  and  that  reduced  vertebral  body  size 
accounted  for  16%  of  the  areal  BMD  deficit  of  fracture 
patients  relative  to  controls.  This  bone  size  effect  accounts 
for  the  apparent  discrepancy  between  the  work  of  Bepaer 
and  colleagues,  who  find  C57BL/6J  to  be  a  low  BMD  strain, 
and  that  of  Klein  and  colleagues,  who  find  C57BL/6J  to  be 
the  high  BMD  progenitor  in  the  B  X  D  recombinant  inbred 
system.  The  Beamer  group  follows  volumetric  BMD 
whereas  the  Klein  group  follows  areal  BMD.  Like  C57BL/ 
lOScSnA,  the  progenitor  of  the  HcB/Dem  RC  series, 
C57BL/6J  is  characterized  by  a  large  diaphyseal  diameter, (70) 
resulting  in  a  bone  volume-dependent  increase  in  areal  BMD. 
As  in  different  strains  of  mice,  humans  display  ethnic  differ¬ 
ences  in  bone  volume'10,11,61'71-745  and  bone  size  is  a  highly 
heritable  trait15,6-9,12-15,755  Areal  BMD’s  better  performance 
in  predicting  fracture  risk  compared  with  volumetric  BMD 
thus  is  seen  to  be  the  result  of  its  inherently  compound 
nature — it  includes  a  measure  of  bone  mineralization  and  su¬ 
perimposes  an  anatomic  factor  that  reflects  /.(73,76-81)  Mapping 
QTLs  that  allow  resolution  of  the  anatomic  and  material  as¬ 
pects  of  bone  strength  therefore  is  a  notable  step  toward  un¬ 
raveling  the  complexities  of  fracture  risk. 

Work  to  date  has  made  it  apparent  that  family  relation¬ 
ships  among  genes  have  been  conserved  over  evolution. 
One  practical  result  of  these  findings  is  that  genes’  functions 
are  similar  in  humans  to  their  functions  in  model  organisms. 
Moreover,  we  have  learned  that  .the  organization  of  chro¬ 
mosome  structure  in  these  two  species  is  highly  conserved, 
with  the  preservation  of  so-called  syntenies,  or  groups  of 
physically  linked  genes  that  have  remained  together  over 
the  course  of  evolution.  Only  about  200  major  chromosome 
rearrangements  are  thought  to  have  occurred  since  the  hu¬ 
man  and  murine  lineages  diverged/82-  855  Therefore,  a  sec¬ 
ond  practical  consequence  of  the  genome  project’s  progress 
is  that  genetic  mapping  data  from  the  mouse  can  be  used  to 
predict  the  locations  of  corresponding  human  genes.  The 
existence  of  conserved  synteny  relationships  will  allow  test¬ 
ing  of  the  roles  of  genetic  loci  identified  in  the  mouse  in 
human  populations. 

The  data  presented  previously  are  limited  in  several  im¬ 
portant  ways.  First,  sample  sizes  are  relatively  small  and  are 
limited  to  6-month-old  female  mice.  Beamer  and  colleagues 
have  shown  that  bone  mass  in  mice  does  not  remain  con¬ 
stant  throughout  adulthood  and  that  change  in  BMC  follows 
different  time  courses  in  different  strains/235  Loss  of  bone 
mass  during  aging  also  has  been  established  by  the  SAMP6 
mouse,  which  develops  osteoporosis  as  it  ages/865  Our  data 
do  not  address  any  aspect  of  bone  turnover  over  an  individ¬ 
ual’ s  lifetime. 

Second,  there  are  important  differences  in  bone  metabo¬ 
lism  between  mice  and  humans.  The  mouse  skeleton  under¬ 
goes  virtually  no  osteonal  remodeling,' 1875  whereas  such 
remodeling  is  a  central  feature  of  human  bone.  Although 
mice  grow  primarily  during  the  first  4  months  of  life,  their 
epiphyses  remain  open  throughout  their  lives.  The  murine 
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estrous  cycle  is  quite  dissimilar  to  human  menses  and  mice 
do  not  undergo  menopause  in  midlife.  Each  of  these  differ- 
'  ences  limits  the  applicability  of  the  findings  reported  here  to 
human  bone  properties. 

Third,  important  limitations  accompany  the  choice  of  the 
HcB/Dem  system.  Only  24  strains  were  examined,  limiting 
the  power  of  the  linkage  study  to  an  exploratory  level. 
Mapping  was  performed  with  strain-averaged  data,  obscur¬ 
ing  intrastrain  variability  and  weighting  the  individuals 
from  the  strains  with  the  smallest  sample  sizes  most  heavily. 
Genotypic  data  for  the  HcB/Dem  strains  only  include  ap¬ 
proximately  130  markers  and  their  genotypes  include  some 
relatively  large  untyped  segments. (19,20)  The  RC  breeding 
scheme  aggravates  the  impact  of  gaps  in  the  HcB/Dem 
strain  distribution  pattern.  There  are  multiple  opportunities 
for  crossing  over  to  occur  during  the  inbreeding  process,  so 
that  the  extent  of  the  genome  spanned  by  each  marker  in  RC 
lines  is  only  approximately  one-fourth  that  spanned  by 
markers  in  a  single-generation  experiment/21,22,885  There 
are  likely  to  be  additional  small,  unidentified  differential 
segments  between  the  two  parental  strains.  Moreover,  there 
may  be  segregating  QTLs  that  contribute  to  the  phenotypes 
examined  that  this  experiment  was  unable  to  detect.  Incor¬ 
poration  of  additional  markers  to  the  HcB/Dem  strain  dis¬ 
tribution  pattern  will  allow  this  issue  to  be  addressed. 

Fourth,  although  biomechanical  tests  are  more  closely 
related  to  fracture  risk  than  is  BMC,  the  fractures  generated 
in  these  tests  are  still  artifactual  with  regard  to  fracture  site 
and  fracture  mechanism.  .Three-point  bending  of  the  mouse 
humerus  assesses  cortical  bone  strength,  because  the  site  is 
(  virtually  devoid  of  trabecular  bone.  The  stresses  that  lead  to 
Mpman^fragility  fractures  and  hip  fractures  in  particular 
iv  •'.'prdhhh^.  differ  from  the  fractures  generated  in  our  biome- 
v/chanical  tests  in  important  ways. 

Fifth,'  not  all 'the  phenotypes  studied  are  of  equivalent 
status.  Failure  load,  structural  stiffness,  body  mass,  BMI, 
ash  percentage,  and  the  various  diameters  are  measured 
directly.  Failure  stress,  modulus,  CSA,  and  I,  in  contrast,  are 
calculated  from  the  directly  measured  phenotypes.  Al¬ 
though  all  measurements  are  subject  to  potential  errors,  only- 
the  calculated  parameters  are  subject  to  compounded  errors. 

Sixth,  the  LOD  scores  of  potential  QTLs  are  systemati¬ 
cally  overestimated  as  discussed  previously,  primarily  as  a 
consequence  of  the  small  number  of  independent  genotypes 
examined.  This  means  that  QTL  mapping  assignments 
based  on  either  recombinant  inbred  or  RC  strain  data  must 
be  confirmed  in  an  independent  breeding  experiment.  This 
limitation  applies  equally  to  the  data  presented  here  and 
those  presented  by  Klein  and  coworkers/515 

These  limitations  notwithstanding,  the  experiments  re¬ 
ported  here  are  a  comprehensive  analysis  of  the  biome¬ 
chanical  properties  of  humeri  and  of  their  structural,  mate¬ 
rial  underpinnings  and  a  preliminary  investigation  of  their 
genetic  basis  in  the  HcB/Dem  RC  system.  The  virtue  of 
combining  QTL  mapping  with  comprehensive  phenotypic 
analysis  is  that  insight  is  gained  not  only  regarding  the 
chromosomal  locations  of  the  genes  contributing  to  bone 
strength,  but  also  regarding  the  mechanisms  by  which 
strength  is  achieved. 
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Perspective 

Breaking  Down  Bone  Strength: 

A  Perspective  on  the  Future  of  Skeletal  Genetics 


ROBERT  D.  BLANK 


’  INTRODUCTION 


W  P™  increasing  intensity  over  the  past  decade,  re- 
VV  searchers  have  pursued  genetic  approaches  to  under¬ 
standing  the  basis  of  skeletal  fragility.  This  month’s  issue  of 
the  JBMR  includes  a  report  by  Beamer  and  associates'-15  that 
identifies  quantitative^  trait  loci  (QTLs)  for  femoral  and 
vertebral  volumetric  bone  mineral  density  (vBMD)  and 
presents  us  with  an  opportunity  to  reflect  on  recent  progress 
and' future  directions  in  bone  genetics. 


’.  CHOICE  OF  PHENOTYPES 


■  BMD  is  only  one  of  several  parameters  related  to  biome¬ 
chanical  performance.  Fracture,  not  BMD,  is  the  clinically 
relevant  outcome;  ideally,  scientists  would  perform  genetic 
analyses  with  fracture  as  an  endpoint  rather  than  on  surro¬ 
gate  traits  that  correlate  imperfectly  with  this  abnormality. 
Deng  and  colleagues'21  estimated  that  the  narrow-sense  her- 
itability  of  Colles’  fracture  was  approximately  0.25  in  a 
cohort  of  white  American  women,  thus  accounting  for  ap¬ 
proximately  one-quarter  of  total  Colles’  fracture  risk  varia¬ 
tion  observed.  This  indicates  that  the  magnitude  of  the 
genetic  contribution  to  fracture  warrants  further  study,  but 
important  caveats  exist.  Fractures  are  relatively  rare  and 
tend  to  occur  late  in  life.  Should  the  genetic  basis  of  human 
fracture  prove  to  be  quantitative,  then  the  contributions  of 
individual  loci  to  the  total  fracture  risk  are  likely  too  small 
to  allow  successful  mapping  of  individual  QTLs  with  real¬ 
istic  sample  sizes.  These  considerations  impose  important 
and  possibly  insurmountable  obstacles  to  recruitment  of  an 
adequate  population  in  which  to  perform  mapping  studies  of 
Colles’  fracture  in  humans.  In  addition,  because  Colles’ 


fractures  are  frequent  relative  to  hip  fractures,  collecting  an 
adequate  sample  to  perform  equivalent  analyses  of  hip 
fracture  may  prove  a  daunting  challenge.  Further,  vertebral 
fractures,  although  relatively  common,  do  not  always  come 
to  clinical  attention'3,41  and  their  diagnosis  may  prove 
uncertain,'5'91  posing  potential  problems  in  the  study  of 
vertebral  fracture  risk. 

Investigators  have  adopted  alternative  strategies  using 
surrogate  endpoints  to  overcome  the  practical  limitations  of 
studying  human  fracture  directly.  One  approach  uses  bio¬ 
mechanical  performance  in  a  contrived  experimental 
scheme  as  an  analog  for  natural  fracture;  this  allows  func¬ 
tional  assessment  of  the  bone,  but  the  mechanisms  in  bio¬ 
mechanical  testing  differ  from  those  of  naturally  occurring 
fractures.  In  addition,  cadaver  specimens  and  specimens 
obtained  from  model  organisms,  both  used  frequently  in 
these  investigations,  impose  additional  differences  from  in 
vivo  human  fractures.  A  second  approach,  based  on  epide¬ 
miological  data,  focuses  on  predictors  of  fracture  rather  than 
analogs  of  fracture  and  is  reflected  by  the  vast  literature  on 
densitometric  assessment  of  bone.  The  success  of  these 
methods  is  incontrovertible;  indeed  the  World  Health  Or¬ 
ganization  (WHO)  diagnostic  criteria  for  osteoporosis  rely 
primarily  on  densitometric  assessment.'101  The  same  logic 
underlies  investigations  based  on  instruments  that  assess 
lifestyle  factors,  comorbidities,  and  mobility,  and  there  is 
broad  agreement  in  the  community  regarding  the  risk  fac¬ 
tors  for  fracture.'1 1,121  However,  markers  of  fracture  risk  are 
variously  defined  and  although  related  are  nevertheless  dis¬ 
tinct.  This  is  readily  apparent  when  comparing  densitomet¬ 
ric  parameters;  cross-calibrating  fracture  risk  based  on  T  scores 
obtained  by  different  techniques  at  different  sites'13-191 
poses  an  important  patient  care  problem.  Similarly,  consid- 
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erable  overlap  exists  among  the  data  included  in  assess¬ 
ments  of  clinical  risk  factors  for  fracture,  but  cross¬ 
calibration  among  instruments  has  not  yet  been  achieved. 

Optimizing  predictive  ability  does  not  necessarily  ad¬ 
vance  mechanistic  understanding  of  fracture  biology.  Areal 
BMD  (aBMD)  measured  by  dual-energy  X-ray  absorptiom¬ 
etry  (DXA)  is  the  single  best  predictor  of  fracture  risk  at  the 
studied  site  (using  lateral  view  at  the  spine).  Scientists  have 
established  that  large  bones  have  higher  aBMDs  than  small 
bones  having  equal  vBMD(20)  because  of  projection  artifact. 
We  also  know  that  larger  bones  are  stronger  than  smaller 
bones  having  equivalent  tissue  strength.  By  combining  both 
size  and  mineral  density  into  a  single  measure,  aBMD 
achieves  better  fracture  prediction  than  vBMD  (in  which 
the  contribution  of  bone  size  is  explicitly  removed  from 
consideration).  Yet,  to  understand  the  underlying  bone  bi¬ 
ology,  we  must  treat  bone  size  and  BMD  as  distinct 
properties/21-235 

In  contrast,  biomechanical  studies'24,255  divide  perfor¬ 
mance  into  components  that  promote  investigation  of  mech¬ 
anisms.  The  most  basic  distinction  is  between  structural  and 
material  properties.  Structural  properties  are  possessed  by 
an  entire  bone  and  are  determined  jointly  by  its  anatomy  and 
its  tissue  strength.  Material  properties  are  those  of  the  bone 
tissue  per  se,  after  correction  for  differences  in  anatomy. 
Distinct  components  of  biomechanical  performance  also 
can  be  identified  in  interpreting  the  load-deformation  curve 
obtained  in  testing  a  specimen,  and  each  of  these  properties 
has  a  corresponding  material  parameter  obtained  by  normal- 
■  izing  for  anatomy.  Different  testing  schemes  yield  informa¬ 
tion  on  Qther  mechanicalproperties;  for  example,  repetitive, 
jqw-inteti’sity  loading  can  be  used  to  assess  fatigue.  Simi- 
.  iarly,  tKe;  ?r^ttomic  component  of  structural  strength  can  be 
divided  furfher;  the  distinction  between  cortical  and  trabec¬ 
ular  bone  reflects' this  division,  as  do  ongoing  efforts  to 
quantify  various  architectural  bone  properties. 

~  Biomechanical  studies  naturally  lead  to  efforts  to  analyze 
bone  strength  into  ever  more  refined  components,  but  such 
partition  is  not  always  achieved  in  practice.  Bone  is  an 
anisotropic  tissue,  and  its  biomechanical  performance  varies 
according  to  the  orientation  of  the  tissue  relative  to  the 
applied  load(26l27>;  this  is  particularly  evident  in  trabecular 
bone/28-305  Despite  these  limitations,  a  reductionists  pro¬ 
gram  does  advance  our  understanding  of  the  biomechanical 
properties  of  bone. 

The  reductionist  approach  embodied  by  biomechanics 
does  not  preclude  epidemiological  assessment.  Both  epide¬ 
miological  and  interventional  investigations  have  identified 
risk  factors  for  fracture  other  than  bone  mass.  Advancing 
age,(31,32)  past  personal  atraumatic  fracture  history/1 1,33-355 
and  maternal  hip  fracture  histoiy°2)  increase  fracture  risk. 
In  trials  of  alendronate  and  transdermal  estrogen,  the  frac¬ 
ture  benefit  exceeded  that  predicted  from  the  observed 
BMD  increase/36,375  In  contrast,  sodium  fluoride  treatment 
increases  BMD  but  does  not  improve  and  may  even  worsen 
fracture  outcomes/38,395  Taken  together,  these  data  show 
that  non-BMD  factors,  often  referred  to  collectively  as  bone 
quality,  contribute  to  the  material  strength  of  bone,  thereby 
linking  disparate  research  areas. 


LINKAGE  GENETICS  OF  BONE: 
PRESENT  STATUS 

Researchers  use  several  different  strategies  to  study  skel¬ 
etal  genetics.  Each  provides  a  potentially  valuable  approach 
and  addresses  somewhat  different  questions/405  The  linkage 
studies  considered  here  address  the  locations  of  genes  (loci) 
in  which  segregation  contributes  to  differences  in  the  trait  of 
interest  In  human  samples,  this  question  is  distinct  from 
that  of  identifying  which  allele  of  that  gene  is  responsible 
for  different  values  of  the  trait.  In  contrast,  the  existence  of 
inbred  mouse  lines  allows  investigation  of  both  loci  and 
alleles  to  be  accomplished  simultaneously/415 

In  human  samples  scientists  have  successfully  mapped  a 
locus  conferring  high  BMD  as  a  simple  Mendelian  trait/425 
Roller  et  al.  implicate  the  same  genomic  region  as  a  QTL 
contributing  to  less  dramatic  variation  in  BMD/43,445  Two 
other  human  genome  scans  reported  evidence  of  possible 
linkage  for  hip  and  spine'455  and  forearm/465  These  inves¬ 
tigations  found  evidence  for  a  QTL  on  chromosome  2p  in 
spite  of  the  different  sites  and  populations  studied.  Each  of 
these  studies  used  aBMD  as  the  phenotypic  endpoint,  based 
on  the  ease,  economy,  safety,  and  acceptance  of  aBMD  as  a 
surrogate  measure  of  skeletal  fragility. 

In  mice,  researchers  have  studied  a  broader  range  of 
phenotypes.  Klein  et  al/475  studied  whole  body  aBMD  in  the 
B  X  D  recombinant  inbred  lines.  Using  DXA  scanning, 
Benes  and  colleagues'485  examined  vertebral  aBMD  in  a 
pair  of  crosses  between  SAMP6  and  either  SAMR2  or 
AKR/J.  Shimizu  et  al/495  studied  femoral  cortical  thickness 
index,  the  fraction  of  diaphyseal  cross-section  occupied  by 
cortical  bone  on  plain  radiographs,  in  a  cross  between 
SAMP6  and  SAMP2.  Beamer  and  colleagues  have  reported 
two  related  studies:  a  cross  between  C57BL/6J  and  Cast/EiJ 
in  which  femoral  vBMD  was  mapped'505  and  a  study  of 
insulin-like  growth  factor  I  (IGF-1)  levels  in  a  cross  be¬ 
tween  C57BL/6J  and  C3H/HeJ/515  Humeral  biomechanical 
performance  and  dimensions,  ash  percentage,  and  body 
mass  in  the  HcB/Dem  recombinant  congenic  strains  were 
assessed  by  Yershov  et  al/525 

.  In  this  issue  of  the  JBMR,  Beamer  et  al.  present  the  results 
of  mapping  femoral  and  vertebral  vBMD  in  a  cross  between 
C57BL/6J  and  C3H/HeJ.05  Their  results  are  notable  for 
several  reasons.  First,  a  subset  of  the  QTLs  contributes  to 
vBMD  at  both  sites  studied,  while  others  contribute  to 
vBMD  at  only  one  site,  providing  a  biological  context  for 
the  clinical  finding  of  disparate  BMDs  at  different  sites  in 
the  same  individual.  Second,  in  several  cases  the  same 
QTLs  have  been  mapped  in  different  crosses,  using  different 
but  related  phenotypes.  This  provides  compelling  evidence 
that  a  bone  mass  gene  in  fact  has  been  identified.  Third,  the 
number  of  animals  studied  in  this  cross  enabled  testing  for 
pairwise  epistatic  interactions,  which  were  not  found.  It  is 
too  soon  to  say  that  epistasis  does  not  contribute  signifi¬ 
cantly  to  vBMD  variation  in  general,  but  these  efforts  re¬ 
mind  us  that  we  must  investigate  gene-gene  and  gene- 
environment  interactions.  Fourth,  the  authors  are 
unequivocal  in  stating  their  ambition  to  help  clarify  our 
understanding  of  the  mechanisms  underlying  bone  pheno¬ 
types. 
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Successfully  mapping  QTLs  represents  an  early  step  in 
understanding  the  genetic  basis  of  skeletal  function.  With 
the  approximate  locations  of  some  vBMD  QTLs  identified, 
we  must  face  the  problem  of  determining  which  of  the 
positional  candidates  thus  defined  are  the  relevant  gene(s). 
Similar  research  efforts  in  other  diseases,  notably  type  I 
diabetes(53-55)  and  other  autoimmune  diseases, (56-58)  sug¬ 
gest  that  successfully  subdividing  the  overall  phenotype  into 
pathophysiologically  relevant  intermediate  phenotypes  is 
one  of  the  major  tasks  that  lies  ahead.  Subdivision  of  traits 
may  lead  to  phenotypes,  the  genetics  of  which  are  Mende- 
lian  rather  than  quantitative,  greatly  simplifying  their  ge¬ 
netic  analysis,  as  illustrated  by  the  high  bone  mass  locus/42’ 
Even  if  we  do  not  succeed  in  reducing  traits  to  oligogenic 
phenotypes,  the  effort  undoubtedly  will  stimulate  new  ideas 
about  how  whole  bone  anatomy  is  established,  how  bone 
responds  to  mechanical  loading,  how  matrix  proteins  and 
minerals  interact,  and  how  bone  metabolism  changes  during 
development  arid  aging.  Successful  integration  of  all  these 
research  areas  into  genetic  investigations  of  fracture  risk 
should  enable  us  to  choose  among  the  myriad  of  potential 
intermediate  phenotypes.  In  addition,  new  instruments  and 
new  analytical  methods  provide  opportunities  to  define  ad¬ 
ditional  phenotypes. 150  “62) 

Work  to  date  has  already  used  a  variety  of  different,  albeit 
related,  phenotypes.  Intermediate  phenotypes  will  add  more 
traits  into  the  mix  and  present  us  with  the  additional  chal¬ 
lenge  of  relating  the  phenotypes  to  each  other.  If  we  design 
,  future  genetic  experiments  to  include  a  variety  of  traits,  we 
-  tiaay  discover  evidence  of  pleiotropy,  that  is,  a  single  gene 
V-  •  conffomng  multiple  phenotypes/52,63,64’  The  data  presented 
,’Oby  Beamer  and  colleagues0’  contribute  to  this  effort  by 
distinguishing  between  QTLs  acting  at  both  femur  and  spine 
and  QTLs  acting  in  a  site-specific  manner. 

Finally,  progress  in  genomics  favors  investigations  based 
on  interesting  and  insightful  biology.  Recent  completion  of 
the  human  genome  sequence165’  and  the  pending  completion 
of  the  mouse  and  rat  sequences  will  devalue  some  of  the 
skills  geneticists  have  mastered.  High-density  maps  and 
microarray  technologies  will  allow  adoption  of  multigen- 
erational  association  analyses  having  greater  statistical 
power  than  traditional  linkage  methods/66’  These  advances 
demand  that  the  bone  geneticists  of  the  future  have  com¬ 
mand  of  bone  biology.  The  article  by  Beamer  et  al/1’ 
provides  a  first  step  in  achieving  this  goal. 
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ABSTRACT 

OBJECTIVE 

Development  of  a  systematic  mutation  detection  assay  strategy  for  denaturing  high  performance  liquid  chromatography 
(DHPLC). 

DESIGN 

Adaptation  of  Guanine  and  Cytosine  (GC)-clamping  from  denaturing  gradient  gel  electrophoresis  (DGGE)  to  DHPLC. 

METHODS 

.Three  target  sequences. harboring  known  allelic  variants  were  studied  to  develop  a  general  DHPLC  assay  design  strategy. 
Th,ese  were  exon  10  of  the  human  /?£T(REarranged  during  Transfection)  gene,  exon  52  of  the  mouse  Colla2  gene,  and  exon 
•9  of*  the'human  FAS  (APO-1,  CD-95)  gene.  Available  software  was  used  to  analyze  melting  curves  and  determine  assay  condi- 
.  bqrts.‘  GtLcIsmps’ of  20  bp  or  36  bp  were  added  to  polymerase  chain  reaction  (PCR)  primers  to  introduce  a  high  melting  tem- 

’ .;  perature  (T^)  domain  to.each  of  the  target  molecules.  DHPLC  was  performed  under  partially  denaturing  conditions. 

RESULTS  \ 

'DHPLC  assays  of  PCR-amplified  sequences  can  be  developed  using  a  personal  computer.  The  following  three  steps  allowed 
for  mutation  detection  in  all  three  targets. 

1)  The  target  sequence  should  have  a  uniform  Tm 

2)  GC  clamps  of  length  sufficient  to  introduce  a  second  melting  domain  with  a  Tm  >  8°  above  that  of  the  target 
sequence  should  be  appended  to  one  of  the  primers. 

3)  The  DHPLC  assay  should  be  performed  at  the  highest  temperature  at  which  the  target  sequence  is  predicted  to 
be  >  90%  double  stranded 

CONCLUSION 

Addition  of  GC-clamps  to  primers  facilitates  mutation  detection  by  DHPLC. 

The  theoretical  basis  for  this  observation  is  identical  to  that  underlying  the  utility  of  GC-clamps  in  DGGE. 
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INTRODUCTION 


Efficient,  robust  mutation  detection  methods  can  potentially 
have  a  major  impact  on  the  diagnosis  of  genetic  disorders 
and  the  identification  of  genetic  contributions  to  multifacto¬ 
rial  disorders.  Many  investigations  depend  on  relating  geno¬ 
typic  variations  to  specific  phenotypes.  Such  assays  should 
ideally  possess  several  widely  recognized  performance  char¬ 
acteristics.1^ 

This  report  describes  a  simplified  strategy  for  designing 
mutation  detection  assays  by  denaturing  high  performance 
liquid  chromatography  (DHPLC),  based  on  DHPLC’s  ability 
to  separate  partially  denatured  double  stranded  DNA 
molecules.  Particular  attention  is  devoted  to  the  use  of  GC 
clamps  with  this  system,  allowing  assay  design  to  be  carried 
out  in  a  nearly  algorithmic  fashion.  This  approach  allows 
investigators  to  shift  their  efforts  from  mutation  detection 
assay  design  to  interpretation  of  the  biological  consequences 
of  detected  sequence  variation. 

MATERIALS  AND  METHODS 

DNA  preparation 

Mouse  and  human  DNA  were  prepared  from  tail  and  periph¬ 
eral  blood  mononuclear  cells  using  the  Invitrogen  (Carlsbad, 
CA)  and  Puregene  (Minneapolis,  MN)  DNA  extraction  kits, 
respectively.  Human  subjects  provided  written  informed  con¬ 
sent  under  protocols  approved  by  the  Memorial  Sloan- 
Kettering  Cancer  Center’s  IRB3>4  or  the  Hospital  for  Special 
Surgery’s  IRB.5'7 

PCRxj r 

All  amplificatiftps  Were  performed  in  50  pL  reactions  includ- 
.  ing  l  U'of  ReftTaq  DNA  Polymerase  (Sigma,  St  Louis, 
MG),'friTmMffN.TP  mix  (Amersham  Pharmacia, 

Piscataway,  NT),  and  10  mM  Tris-HCl  (pH  8.3)  containing  50 
mM  KC1, 1.5  mM  MgC^,  0.1%  gelatin,  and  20  to 
50  ng  of  genomic  DNA.  Primers  were  designed  using  Primer 
0.5  software.8  Reactions  were  performed  in  a  PE  Biosystems 
(Norwalk,  CT)  2400  thermal  cycler.  Amplifications  were 
carried  out  by  an  initial  denaturation  of  3  minutes  at  95°C, 
followed  by  35  cycles  of  94°C  x  30  seconds,  Tannealing  x  30 
seconds,  and  72°C  x  30  seconds,  and  concluded  by  a  final 
denaturation  at  95°C  x  5  minutes  followed  by  slow  cooling 
to  room  temperature.  Primer  sequences  and  annealing  tem¬ 
peratures  are  as  follows: 
clamp  20  =  GCGGCCCGCCGCCCCCGCCG 
clamp  36  =  CGCCCGCCGCGCCCCGCGCCCGTCCCGC- 
CGCCCCCG9,  FOIM  =  GAAATGGCTTTCCTAGACC- 
CCG,  ROIM  =  AATGATTGTCTTGCCCCATTCA  with 
Tannealing  =  60°C,  primers  HFAS47  and  HFAS495  and 
primers  FRET10  and  RRET1010  and  their  annealing  temper¬ 
atures  were  previously  published. 

DHPLC  conditions 

Loading,  elution  and  washing  of  the  DHPLC  column  was 
carried  out  with  varying  combinations  of  three  buffers 
injected  at  a  flow  rate  of  0.9  ml/min:  Buffer  A  contains 
100  mM  triethylamine  acetate  (TEAA),  pH  7.0  and  0.025% 


acetonitrile,  Buffer  B  contains  25%  acetonitrile,  100  mM 
TEAA,  pH  7.0,  and  0.1  mM  EDTA,  and  Buffer  D  contains 
75%  acetonitrile.  Loading  and  elution  buffers  are  combina¬ 
tions  of  buffers  A  and  B,  whose  relative  proportions  form  a 
gradient  over  a  specified  time  interval.  Buffer  D  is  used  to 
wash  the  column.  DHPLC  elution  buffer  gradients  were  gen¬ 
erated  by  WAVEMAKER  version  3.3.3  software 
(Transgenomic,  Omaha,  NE)  and  are  reported  as  %  buffer  B 
at  specified  times.  Assays  were  performed  using  the  WAVE 
DNA  fragment  analysis  system  (Transgenomic,  Omaha, 

NE).  Oven  temperature  was  determined  from  inspection  of 
the  melting  profile,  choosing  the  highest  temperature  at 
which  the  target  sequence  was  predicted  to  be  >90%  duplex. 
Following  the  gradient  elution,  all  remaining  bound  material 
was  washed  from  the  column  for  36  seconds  with  buffer  D 
and  the  column  was  re-equilibrated  with  the  loading  buffer 
for  156  seconds.  Sample  elution  was  monitored  by 
absorbance  at  260  mn. 

DNA  Sequencing 

PCR  products  were  purified  by  passage  through  Microcon 
microconcentrating  centrifugal  filter  columns  (Millipore, 
Bedford,  MA)  prior  to  sequencing.  Sequencing  reactions 
were  performed  using  the  Applied  Biosystems  Dye- 
Terminator  Kit  and  analyzed  on  a  ABI  Prism  377  DNA 
sequencer  (Applied  Biosystems,  Foster  City,  CA). 

RESULTS 

Fust,  an  established  denaturing  gradient  gel  electrophoresis 
(DGGE)  assay  to  DHPLC  was  adapted,  since  both  DGGE 
and  DHPLC-based  mutation  detection  operate  on  the  princi¬ 
ple  of  detecting  differences  in  melting  behavior  among  indi¬ 
vidual  species  in  a  mixed  population  of  DNA  molecules. 
Previously  a  GC-clamped  DGGE  assay  was  developed  to 
detect  pathogenic  mutations  in  exon  10  of  the  RET  pro¬ 
tooncogene,3’4  this  was  used  as  the  starting  point  in  adapting 
GC-clamping  to  DHPLC.  Without  a  GC  clamp,  the  target 
sequence  has  a  single,  uniform  melting  domain,  with  a  pre¬ 
dicted  Tm  between  66°C  (>90%  double  stranded)  and  67°C 
(>80%  single  stranded).  Addition  of  a  36  bp  GC  clamp 
introduces  a  second  higher  Tm  melting  domain  (figures  1 A 
and  IB).  The  target  sequence  therefore  denatures  under  con¬ 
ditions  in  which  the  GC  clamp  remains  double-stranded.  The 
WAVEMAKER-generated  elution  profile  was  loading  in 
44%  buffer  B,  49%  buffer  B  at  30  seconds,  and  58%  buffer 
B  at  300  seconds.  The  software  recommended  67°C  for 
assay  performance.  RET  exon  10  was  amplified  from  sub¬ 
jects  harboring  the  C620F  and  C620R  mutations,  suffering 
from  familial  medullary  carcinoma  of  the  thyroid  and  MEN 
2A,  respectively,  and  from  unaffected  family  members.4 
Mutations  were  detected  at  66°C  (figure  1C)  and  67°C  (not 
shown),  but  not  at  65°C  or  68°C  (not  shown)  when  the  GC 
clamp  was  included.  At  65°C  a  single  sharp  peak  was  seen, 
regardless  of  genotype,  while  at  68°  all  genotypes  eluted  as 
a  broad,  low  intensity  peak.  Resolution  of  heteroduplex 
peaks  was  clearer  at  66°C  than  at  67°C.  These  mutations 
were  resolved  at  60°C  using  a  20%  to  60%  gradient  by 
DGGE.3  Mutations  were  not  detected  either  by  DHPLC  or 
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Figure  1. 

A.  Wavemaker-generated  melting  profiles  for  RET  exon  1 0  without  (top)  and  with 
(bottom)  inclusion  of  a  36  base  GC  clamp  on  the  3'  amplification  primer.  The  bold 
lines  under  the  base  pair  axis  indicate  the  extent  of  the  primer  sequences. 

B.  Predicted  melting  behavior  of  GC  clamped  RET10  amplicons  at  65°C,  66°C, 
and  67°C.  The  GC  clamp  remains  double  stranded  at  all  3  temperatures.  The  target 
sequence  is  >  90%  double  stranded  at  65°C  and  66°C,  but  not  at  67°C. 

C.  CDHPLC  elution  profiles  for  3  RET  genotypes  at  66°.  Heterozygotes  harboring 
the  C620F  and  C620R  mutations  are  readily  distinguished  from  normals  and  from 
each  other. 


DGGE  when  the.GC  clamp  was  not  used  (not  shown).  At 
65°C  and  66°C,  unclamped  samples  elute  as  a  single  sharp 
.  peak  regardless  of  genotype,  while  at  67°C  and  68°C  they 
|  elute  as  a  broad,  low-amplitude  peak  regardless  of  genotype. 

gNext  we  designed  a  mutation  detection  assay  for  the  mouse 
,  o  mutation;  a  one-base  pair  deletion  of  a  G  residue 
*?exon  52,  encoding  the  c-terminal  propeptide  of  tire  oc2 
Betm  *  procollagen.1  b12  The  melting  profile  was 
ermine d  to  be  uniform  in  the  target  region  (figure  2A). 
1?^®t  sequence  is  >90%  double  stranded  at  58°C  but 
|at  59aC,  so  assays  were  conducted  at  58°C  (figure  2B). 
|,ient  conditions  were  injection  in  43%  buffer  B,  48% 

If  B  at  30  seconds,  and  57%  buffer  B  at  300  seconds, 
^zygotes  were  readily  distinguished  from  either  wild- 


type  or  mutant  homozygotes  when  a  36  bp  GC  clamp  was 
included  (figure  2C),  but  not  when  it  was  omitted  (not 
shown).  As  in  the  case  of  RET  exon  10,  the  unclamped  prod¬ 
ucts  all  elute  as  single  sharp  peaks  at  58°C,  a  temperature  at 
which  the  molecule  is  predicted  to  be  double  stranded.  We 
did  not  attempt  to  resolve  this  target  at  other  temperatures.  A 
50:50  mix  of  wild-type  and  mutant  homozygous  samples 
eluted  as  a  single  sharp  peak  when  unheated,  but  eluted  as  a 
pair  of  peaks  when  heated  to  95°C  and  cooled  slowly  (figure 
2D),  This  demonstrates  that  assay  conditions  are  robust  for 
detecting  heterozygosity,  but  are  not  adequate  for  determin¬ 
ing  the  allele  present  in  homozygous  samples.  Moreover,  the 
behavior  of  the  unheated  mixture  reveals  that  there  is  no  dif¬ 
ference  in  the  elution  times  of  the  wild-type  and  mutant 
products,  as  the  peak  is  not  broadened  relative  to  the  peak 
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Figure  2. 

A.  Wavemaker-generated  Col 7 a2  GC  clamped  melting  profile.  The  bold  lines  under 
the  base  pair  axis  indicate  the  extent  of  the  primer  sequences. 

B.  The  predicted  melting  behavior  of  Col1a2  amplicons  at  57°C,  58°C,  and  59°C. 

C.  Resolution  of  Col1a2  heterozygotes  from  homozygotes.  A  Col1a2oim/'+  het¬ 
erozygote  is  separated  from  either  wild  type  or  oim/oim  homozygotes.  A  denatured 
and  slowly  reannealed  mixture  of  the  wild  type  and  oim/oim  PCR  products  pro¬ 
duces  an  elution  profile  identical  to  that  of  the  oim/+  heterozygote. 

D.  Inability  to  resolve  +/+  homozygotes  from  oim/oim 

homozygotes.  Both  homozygotes  and  an  unheated  mixture  of  oim/oim  and  +/+ 

PCR  products  result  in  single  elution  peaks.  However,  heating  the  mixture  to  dena¬ 
ture  the  DNA  and  allowing  it  to  reannea!  slowly  allows  heteroduplex  formation 
and  results  in  the  appearance  of  a  second  elution  peak,  as  for  oim/+  heterozygotes. 

generated  by  homozygous  samples.  This  finding  indicates  Homozygous  samples  can  be  genotyped  in  a  two-step  assay. 

that  decreased  Tm  at  the  site  of  mismatch  is  a  critical  ele-  1)  The  assay  is  performed  on  a  pure  sample 

ment  of  mutation  detection  by  DHPLC.  2)  Samples  found  to  be  homozygous  are  mixed  with  a 

reference  wild-type  sample,  denatured,  reannealed  and 
1  reanalyzed 
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Mutant  homozygotes  will  yield  a  heterozygous  pattern  on 
mixing,  while  wild-type  homozygotes  will  retain  a  homozy¬ 
gous  pattern  on  mixing. 

The  same  strategy  was  applied  for  the  detection  of  the 
D244G  and  R234P  missense  mutations  in  exon  9  of  the 
human  FAS  (. APO-1 ,  CD95)  gene,  which  cause  the  Canale- 
Smith  Syndrome5’7.  The  melting  profile  predicts  that  59°C 
is  the  highest  temperature  at  which  the  target  sequence  is 
>90%  double  stranded  (figures  3A  and  3B).  DHPLC  analy¬ 
sis  at  59°C  with  injection  at  45%  buffer  B  and  elution  with  a 
gradient  profile  of  50%  buffer  B  at  30  seconds,  and  59%  at 
300  seconds  is  shown  in  figure  3C.  The  mutations  were 
readily  detected  with  a  20  bp  GC  clamp,  while  omission  of 
the  GC  clamp  did  not  allow  us  to  detect  the  mutations  under 
these  conditions,  with  all  genotypes  eluting  as  a  single  sharp 
peak  (not  shown). 


DISCUSSION 

Sheffield  and  colleagues  demonstrated  the  utility  of  GC 
clamps  in  designing  DGGE  assays,9  showing  that  the  inclu¬ 
sion  of  a  clamp  allowed  detection  of  mutations  in  the  murine 
pmajor  giobin  gene  that  were  undetectable  without  clamping. 
The  additional  sensitivity  arose  from  detection  of  mutations 
in  the  highest  Tm  domain  of  the  target  sequence.  Subsequent 
sensitivity  analyses  of  DGGE  have  consistently  found  that 
analyses  performed  with  GC  clamps  have  sensitivities  of 
~95%  and  specificities  approaching  unity.1 3-16  The  theoreti¬ 
cal  basis  for  this  increase  in  sensitivity  is  that  resolution  of 
the  various  molecular  species  occurs  primarily  when  individ¬ 
ual  molecules  are  partially  denatured.  Inclusion  of  a  GC 
clamp  provides  an  artificial,  high  Tm  domain  in  the 
molecules  being  analyzed,  allowing  the  target  sequence  to 
occur  in  the  context  of  a  low  Tm  domain.  The  data  suggest 
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Figure  3. 

A.  Wavemaker-generated  FAS  exon  9  CC  clamped  melting  profile. 

B.  Temperature  and  FAS  Exon  9  Melting.  The  predicted  melting  behavior  of  FAS 
amplicons  at  58°C,  59°C  and  60°C. 

C.  Resolution  of  R234P  and  D244G  heterozygotes  from  a  normal  homozygote. 
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that  resolution  of  the  various  molecular  species  by  DHPLC 
also  occurs  primarily  when  these  have  partially  denatured,  so 
that  the  principles  guiding  design  of  DGGE  assays  can  be 
applied  to  DHPLC  mutation  detection. 

Heteroduplexes  include  mismatches  of  one  or  more  base 
pairs,  resulting  in  early  melting  of  the  mismatched  region. 
DHPLC  allows  resolution  of  heteroduplex  from  homoduplex 
DNA  molecules  based  on  differences  in  their  retention  time 
in  the  column  under  partially  denaturing  conditions,  with 
heteroduplex  molecules  eluting  earlier.  This  behavior  mir¬ 
rors  heteroduplexes’  lower  Tm’s  and  is  similar  to  retardation 
of  electrophoretic  transport  at  different  points  along  a  dena¬ 
turing  gradient  gel.  In  DGGE,  mutations  in  the  highest  Tm 
domain  are  resolved  poorly,  if  at  all.  It  was  determined  that 
while  the  analytical  technology  differs  in  DHPLC,  the  prin¬ 
ciple  of  heteroduplex  detection  should  be  the  same  as  in 
DGGE  and  that  inclusion  of  GC  clamps  would  facilitate 
mutation  detection.  The  inability  to  detect  sequence  variants 
in  the  highest  Tm  domain  may  explain  the  anecdotally  poor 
ability  of  DHPLC  to  detect  mutations  in  short  amplicons  (K. 
Hecker,  personal  communication),  which  often  include  only 
a  single  melting  domain. 

A  basic  and  general  approach  to  designing  new  mutation 
screening  assays  by  DHPLC  was  established.  First,  the  target 
region’s  sequence  is  evaluated  for  uniformity  of  Tm.  If  this 
condition  is  not  satisfied,  alternative  primers  are  chosen  in 
order  to  generate  a  target  sequence  with  a  uniform  Tm. 
Second,  a  GC  clamp  of  sufficient  length  to  create  a  high-Tm 
domain  is  appendedto  one  of  the  primers.  Once  again,  the 
predicted  melting  curve  is  inspected,  to  ensure  that  the  ' 
‘clamp  produnesfthe  desired  high  Tm  domain,  as  illustrated  in 
"  the  “A”;pan0ls.of  the  figures.  Melting  curves  of  the  PCR 
product  are  calculated  and  the  highest  temperature  at  which 
thg  target  sequence  is  >90%  duplex  is  chosen  for  assay  per¬ 
formance,  as  illustrated  in  the  “B”  panels  of  the  figures. 
WAVEMAKER  software,  native  to  our  DHPLC  instrument, 
allowed  both  the  calculation  of  melting  profiles  and  genera¬ 
tion  of  elution  profiles.  When  using  this  software,  inspection 
of  the  melting  profiles  is  necessary.  WAVEMAKER  some¬ 
times  suggests  an  assay  temperature  that  is  too  high  for  opti¬ 
mal  resolution  of  sequence  variation.  Freely  accessible  pro¬ 
grams  perform  equivalent  calculations  and  give  equivalent 
results.17'20 

The  trio  of  assays  described  here  demonstrate  that  inclusion 
of  a  GC  clamp  allows  detection  of  mutations  by  DHPLC 
under  conditions  that  can  be  fully  established  prior  to  assay 
performance,  using  predicted  melting  curves.  The  experi¬ 
ments  presented  above  do  not  exclude  the  development  of 
assays  without  GC  clamps,  but  rather  show  that  the  condi¬ 
tions  identified  in  silico  work  well  without  further  optimiza¬ 
tion  when  GC  clamps  are  included.  Unclamped  primers  for 
RET  exon  10  were  tested  at  four  temperatures.  The  other  two 
assays  were  tested  at  a  single  temperature.  We  did  not 
attempt  to  optimize  gradient  conditions.  It  was  expected  that 
such  optimization  would  likely  have  allowed  mutation  detec¬ 
tion,  even  with  unclamped  molecules. 


In  the  assays  described  here,  the  GC  clamp  had  a  melting 
point  8°C  to  13°C  higher  than  the  target  sequence,  while  the 
target  sequence  varied  in  Tm  by  no  more  than  3°C. 
Therefore,  investigators  should  include  GC  clamp  sequences 
sufficiently  long  to  provide  an  8°C  increment  in  Tm  relative 
to  the  target.  The  required  length  will  vary  according  to  the 
Tm  of  the  target  sequence,  but  it  is  straightforward  to  deter¬ 
mine  the  minimum  necessary  length  with  melting  profile 
software  prior  to  ordering  modified  primers. 

Theoretically,  psoralen  clamping  provides  an  alternative  to 
GC  clamping  by  generating  a  covalent  interstrand  bond 
rather  than  by  raising  the  Tm  of  a  region  of  the  target 
molecule.21  The  covalent  bond  introduced  by  psoralenation 
and  subsequent  UV  light  exposure  effectively  clamps  one 
end  of  the  molecule  regardless  of  temperature.  The  advan¬ 
tage  of  psoralen  compared  to  GC  clamping  is  that  the  clamp 
need  not  be  changed  from  assay  to  assay;  the  disadvantage  is 
that  an  additional  UV  crosslinking  step  must  be  added  prior 
to  analysis.  However,  experiments  with  psoralen  clamps 
were  not  conducted,  as  a  result  it  is  not  known  if  this 
approach  would  perform  as  well  in  practice  as  in  theory. 

Electrophoretic  mutation  detection  assays  are  difficult  to 
perfonn  on  a  large  scale,  since  casting  and  loading  gels  are 
tasks  that  are  difficult  to  automate.  Because  DGGE  and 
Single  Strand  Conformation  Polymorphism  (SSCP)  suffer 
from  these  limitations,  the  DHPLC-based  assay  is  more 
readily  adaptable  to  even  a  medium-throughput  setting.  The 
WAVE  system  can  load  samples  automatically  from  a  96- 
sample  block.  Each  sample  is  processed  in  approximately  6 
to  9  minutes.  The  automation  provided  by  the  WAVE  system 
vs.  the  gel-based  methods,  allows  investigators  to  perform 
mutation  screens  at  higher  throughput  and  with  less  hands- 
on  time.  Further  improvements  in  throughput  might  be 
achieved  by  pooling  samples  prior  to  DHPLC  analysis,  as 
has  been  done  for  gel-based  methods.22  These  scale  issues 
are  most  relevant  to  research  laboratories  engaged  in  muta¬ 
tion  screening  and  genotype-phenotype  correlation,  and  may 
become  applicable  to  diagnostic  clinical  laboratories  as  well. 

The  assays  we  describe  here  scan  relatively  short  target 
sequences  and  therefore  cannot  estimate  the  maximum 
length  over  which  DHPLC  based  methods  might  work.  High 
sensitivity  and  specificity  have  been  reported  for  DHPLC  in 
a  growing  body  of  analyses  for  sequences  up  to  400  bp  in 
length.23'32  Our  approach  to  primer  design  has  a  shorter 
practical  limit  of  about  250  bp,  imposed  by  our  criterion  that 
target  sequences  have  a  uniform  Tm. 

Other  approaches  may  lead  to  further  improvements  in 
mutation  scanning  methodology.  For  example,  two  groups 
have  recently  described  a  purely  optical  assay  strategy  that 
obviates  the  need  for  physical  separation  of  homoduplex  and 
heteroduplex  molecules.33’34  This  approach  may  prove 
applicable  to  large-scale  genotyping  applications,  but  has  yet 
to  be  widely  validated. 
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The  mutation  detection  strategy  presented  in  this  article 
leads  to  higher  primer  costs  than  other  approaches,  because 
of  the  need  for  either  GC-clamped  or  psoralen-clamped 
primers,  and  shorter  target  sequences.  At  an  estimated  incre¬ 
mental  cost  of  $0. 70/base  x  30  bases  for  “ordinary”  scale 
primer  synthesis,  the  extra  primer  costs  for  a  typical  length 
GC  clamp  is  about  $21.  Psoralen  clamps  are  more  expensive 
than  GC-clamps,  requiring  both  an  arbitrary  primer  adaptor 
arm  and  the  psoralenation,  thus  adding  about  $40  to  the  cost 
of  each  clamped  primer.  Theoretically,  psoralen  primers 
could  be  reserved  for  the  most  GC-rich  targets,  in  which  GC 
clamps  of  practical  length  would  not  impact  Tm.  Costs  aris¬ 
ing  from  extra  primer  pairs  needed  as  a  result  of  studying 
shorter  target  sequences  are  less  easily  quantified  and 
depend  in  part  on  the  scope  of  the  length  of  DNA  to  be 
scanned  for  mutations  and  the  nature  of  its  sequence.  At  an 
average  cost  of  $50/GC  clamped  primer  pair,  addition  of  10 
additional  primer  pairs  leads  to  a  relatively  small  incremen¬ 
tal  cost.  These  are  trivial  costs  compared  with  the  time  of 
technical  personnel,  whose  salaries  will  typically  exceed 
$  10/hour.  Therefore,  even  modest  reductions  in  assay  devel¬ 
opment  time  lead  to  net  cost  savings. 


The  greatest  limitation  on  mutation  scanning  throughput, 
however,  remains  the  time  necessary  to  design  mutation 
detection  assays.  Several  previous  reports  addressed  DHPLC 
assay  design  optimization.  18, 30,31,3s, 36  These  authors  all  rec¬ 
ommend  performing  assays  at  various  temperatures  to  maxi¬ 
mize  sensitivity  and  some  narrow  the  buffer  gradient  ranges, 
requiring  a  still  greater  number  of  temperatures.  Another 
report37  suggests  that  a  gradual  decrease  in  Tm  from'  the 
clamp  to  the.free  eiiri  dt  a  molecule  is  the  optimal  configu- 1 
ration  for  muta^ioijrdetection  by  heteroduplex-based  meth- 
'  ods:  We  tiave  deiijonstrated  that  inclusion  of  GC  clamps 
obviates  the  need,  for  tedious  assay  optimization.  GC  clamps 
of  sufficient  length  to  create  a  domain  with  Tm  8°C  greater 
than  that  of  the  target  sequence  allows  an  experimenter  to 
develop  a  working  assay  in  less  than  one  hour.  As  genomic 
analysis  moves  from  determination  of  additional  sequence 
data  to  understanding  the  biological  consequences  of 
sequence  variation,  the  ability  to  perform  mutation  screening 
efficiently  will  become  ever  more  valuable.  The  algorithm 
for  establishing  DHPLC  conditions  described  here  will  facil- 
itate  this. 

CONCLUSION 

i  Assay  design  principles  adapted  from  those  previously 
&  established  for  DGGE  facilitate  development  of  DHPLC 
piutation  scanning  strategies,  as  both  methods  resolve 
(Sequence  variants  by  virtue  of  differences  in  the  melting 
Iu_avi°r  °f  partially  denatured  DNA  molecules.  Use  of  GC 
e  ’ m  DHPLC  obviates  the  need  for  empirical  optimiza- 
ifon°  f  6W  assa^s'  use  of  clamps  is  preferable  to  evalu- 
®  °  a  greater  number  of  assay  conditions  for  two  rea- 
7  lrst>  unlike  the  situation  with  model  assays  such  as 
kti^6  ^aV£  USe<^  ^or  ^lustration,  investigators  performing 
^screens  will  not  have  positive  controls  available  to 
I. n  tl0ns-  Second,  the  alternative  strategy  of  varying 


conditions  to  increase  sensitivity  depends  primarily  on  vary¬ 
ing  assay  temperature.  Heating  and  cooling  the  instrument 
oven  is  slow,  and  the  time  needed  for  temperature  equilibra¬ 
tion  eliminates  much  of  the  time  advantage  arising  from 
DHPLC. 
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Spectroscopically  Determined  Collagen  Pyr/deH-DHLNL 
Cross-Link  Ratio  and  Crystallinity  Indices  Differ  Markedly 
in  Recombinant  Congenic  Mice  with  Divergent 
Calculated  Bone  Tissue  Strength 
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.  •  tYh'ole  bone  strength  can  be  partitioned  into  structural  and 
material  components.  In  three-point  bending  tests  of  6-month -old 
female  humeri  from  the  HcB/Dem  recombinant  congenic  series, 
sfrains  HcB/8  ahd  HcB/23  differed  markedly  in  calculated  failure 
stress  but  not  ash  percentage.  Fourier  transform  infrared  spec¬ 
troscopic  imaging  was  used  to  determine  whether  differences  in 
the  ratio  of  pyridinoline  (pyr;  nonreducible)  to  dehydrodihydrox- 
ynorleucine  (de-DHLNL;  reducible)  collagen  cross-links  (XLR), 
mineral  crystallinity,  or  spatial  ordering  could  account  for  the 
strains’  differing  biomechanical  performance.  HcB/8  had  signifi¬ 
cantly  higher  XLR  and  significantly  higher  crystallinity  than 
HcB/23.  XLR  and  crystallinity  were  highly  and  similarly  correlated 
in  both  strains.  There  were  no  significant  differences  between  the 
strains’  one-dimensional  Spatial  correlation  functions,  suggesting 
no  difference  in  short-range  order  between  them.  The  strong  corre¬ 
lation  between  XLR  and  crystallinity  reflects  the  interdependence 
of  the  protein  and  mineral  elements  of  bone.  The  data  illustrate  the 
importance  of  material  properties  in  addition  to  mineral  quantity 
to  bone  tissue  strength. 


Keywords  Bone  Quality,  Collagen,  Crystallinity,  FTIR,  Mouse. 
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INTRODUCTION 

Fracture  is  the  most  significant  clinical  measure  of  bone’s 
biomechanical  performance.  Epidemiologically,  several  types  of 
fracture  are  recognized  as  “osteoporotic,”  occurring  in  the  set¬ 
ting  of  minimal  trauma.  However,  bone  mineral  density  (BMD), 
while  a  useful  clinical  predictor  of  fracture  risk,  does  not  fully 
account  for  individual  differences  in  fracture  risk  [1-6]. 

Biomechanical  testing  of  bones  harvested  from  experimental 
organisms  is  one  approach  by  which  the  various  contributors  to 
bone  strength  can  be  studied  [7,  8],  By  loading  bones  accord¬ 
ing  to  contrived  but  reproducible  protocols,  such  tests  are  ca¬ 
pable  of  distinguishing  strain-specific  differences  in  mechanical 
performance.  Moreover,  it  is  well  established  that  the  structural 
strength  of  a  whole  bone  can  be  partitioned  into  architectural  and 
material  components.  The  bone’s  architecture  takes  into  account 
the  bone’s  size  and  shape,  while  the  bone’s  material  reflects  the 
underlying  strength  of  the  tissue. 

Recently,  interest  has  been  growing  in  understanding  the  fac¬ 
tors  contributing  to  whole  bone  strength,  which  is  determined 
by  tine  inherent  material  strength  of  the  tissue,  its  size  and  geom¬ 
etry,  and  its  architecture.  Geometrical  and  architectural  investi¬ 
gations  have  included  human  studies  of  hip  axis  length  [9-15], 
femoral  neck  bone  distribution  [16,  17],  crumpling  ratio  [18], 
sexual  dimorphism  of  bone  modeling  [19-22],  and  trabecular 
connectivity  [23-26].  In  addition,  in  animal  models  there  has 
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been  explicit  use  of  anatomical  measurements  in  the  interpre¬ 
tation  of  biomechanical  studies  [27-30].  Investigation  of  bone 
material  strength  historically  has  relied  upon  biochemical  and 
biophysical  methods  and  has  included  quantitation  of  various 
protein  and  mineral  components  of  bone  tissue  through  a  va¬ 
riety  of  technical  approaches.  Recently,  by  coupling  a  Fourier 
transform  infrared  (FTTR)  spectrometer  with  a  microscope  and 
an  array  detector,  spectral  analysis  has  been  carried  out  on  thin 
histologic  preparations  of  bone,  allowing  the  material  proper¬ 
ties  of  bone  to  be  investigated  on  scales  of  6-7  /r m  [31,  32]. 
These  methods  allow  investigation  of  the  role,  if  any,  the  mi¬ 
croscopic  organization  of  bone  tissue  plays  in  determining  its 
material  strength. 

In  this  article,  we  extend  previous  observations  of  biome¬ 
chanical  performance  in  HcB/Dem  recombinant  congenic  mice 
[30].  Those  experiments  demonstrated  that  the  calculated  tissue 
strength  of  bone  in  strains  HcB/8  and  HcB/23  differ  greatly,  in 
spite  of  similar  mineral  content  (ash  percentage).  Here,  charac¬ 
terization  of  these  strains’  is  extended  to  include  FTIR  imaging 
data  from  cortical  bone  samples,  since  three-point  bending  is  a 
cortical  bone  test.  We  found  that  the  strains  differed  markedly 
in  the  ratio  of  nonreducible  pyridinoline  (pyr)  to  reducible  dehy- 
drodihydroxynorleucine  (de-DHLNL)  collagen  cross-links 
(XLR)  and  bone  mineral  (apatite)  crystal  size  and  perfection 
(crystallinity).  We  also  found  that  these  two  indices  were  highly 
correlated.  No  statistically  significant  differences  were  noted  in 
the  one-dimensional  spatial  correlation  of  either  XLR  or  crys¬ 
tallinity  parallel  to  the  bone  surface.  These  data  are  considered 
with  regard  tQ  the  inechanistic  basis  of  bone  tissue  strength. 

MATERIALS  and  methods 
Mice 

The  HcB/Dem  strains  were  established  and  are  maintained  at 
the  Netherlands  Cancer  Institute.  Briefly,  the  HcB/Dem  mice  are 
inbred  strains  derived  from  arbitrary  pairs  of  N3  backcross  ani¬ 
mals  [33-35].  Each  HcB/Dem  strain  contains  a  random  12.5% 
of  the  C57BL/10ScSnA  genome  and  87.5%  of  the  C3H/DiSnA 
genome.  HcB/8  and  HcB/23  received  different,  slightly  overlap¬ 
ping  portions  of  the  C57BL/10ScSnA  genome;  hence  they  dif¬ 
fer  in  approximately  25%  of  the  genome.  The  HcB/Dem  strains 
have  been  genotyped  at  130  marker  loci  distributed  over  each 
of  the  autosomes  [36,  37].  Because  they  are  inbred,  individuals 
from  a  single  strain  have  the  same  genetic  composition,  save 
for  new  mutations  and  residual  unfixed  chromosome  segments 
[38,  39]. 

Less  than  5%  of  the  genome  was  unfixed  at  the  time  of  geno- 
typing;  residual  heterozygosity  is  expected  to  be  reduced  by  half 
in  each  subsequent  generation  of  inbreeding.  The  mice  described 
in  this  report  were  maintained  at  the  Hospital  for  Special  Surgery 
until  6  to  7  months  of  age  under  12-hr  light-dark  cycling  and  fed 
irradiated  PICO  5058  rodent  chow  and  autoclaved  tap  water  ad 
lib.  Only  females  were  studied,  because  inclusion  of  males  would 
have  introduced  sex-dependent  variability  in  the  traits  in  addi¬ 
tion  to  the  strain-specific  and  environmental  variability  already 


encountered.  At  sacrifice,  body  mass  and  rostroanal  length  were 
measured.  This  work  satisfied  The  Hospital  for  Special  Surgery’s 
requirements  for  the  ethical  use  of  laboratory  research  animals. 

Ash  Percentage 

Bone  mineral  fraction  was  calculated  by  comparison  of  dry, 
defatted  bone  weight  to  ash  weight  of  homogenized  tissue 
[40,  41].  We  used  entire  bones  rather  than  bones  from  which 
the  epiphyses  and  marrow  have  been  removed  because  the  for¬ 
mer  technique  is  more  reproducible  in  our  hands. 

Radiographic  Analysis 

Image  analysis  of  fine  focus  contact  radiographs  of  dissected 
humeri  was  performed  as  described  [41]  and  used  to  calculate 
cross-sectional  area  (CSA)  and  cross-sectional  moment  of  in¬ 
ertia  (/).  Humeral  length  was  defined  as  the  distance  along  the 
diaphysis  from  the  trochlea  to  the  humeral  head’s  most  distant 
point.  Outer  and  inner  diameters  were  measured  in  orthogonal 
projections  just  distal  to  the  deltoid  tuberosity,  perpendicular  to 
the  diaphyseal  axis.  CSA  was  calculated  according  to  the  ellip¬ 
tical  approximation: 

CSA  =  jr(MLOR*APOR-MRIR*APIR)  (1) 

where  OR  is  the  outer  and  IR  the  inner  radius  in  either  the 
mediolateral  (ML)  or  anteroposterior  (AP)  projection.  I  also 
was  calculated  according  to  the  elliptical  approximation  [8]: 

/  =  rr/4[(ML  OR)3(AP  OR)  -  (ML  IR)3(AP  IR)]  (2) 

Image  analysis  was  performed  with  SigmaScan  (Jandel 
Scientific)  image  analysis  software.  All  radiographs  included 
stepped  aluminum  densitometric  phantoms.  Radiographic  im¬ 
ages  were  digitized  with  a  Kodak  digital  camera,  with  the  pho¬ 
tographic  field  including  a  length  scale. 

Biomechanical  Testing 

Quasistatic  three  point  bend  testing  was  performed  on  left 
humeri  using  posts  designed  and  machined  in-house  with  the 
MTS  apparatus  and  Instron  electronics  as  described  [42,  43]. 
Humeri  were  oriented  with  the  deltoid  tuberosity  downward  and 
the  specimens  oriented  with  the  central  post  adjacent  to  the  distal 
end  of  the  deltoid  tuberosity.  This  orientation  corresponds  to  the 
ML  axis  being  parallel  to  the  applied  force.  Posts  were  separated 
by  3.75  mm. 

Biomechanical  data  were  analyzed  following  several  impor¬ 
tant  assumptions.  First,  we  assumed  that  bone  strength  is  de¬ 
termined  entirely  by  the  cortical  bone  in  the  mid-diaphysis. 
Second,  we  assumed  that  the  humeral  diaphysis  is  an  ellipse 
with  its  major  axis  lying  in  the  ML  plane  and  its  minor  axis 
lying  in  the  AP  axis.  Calculated  biomechanical  parameters  were 
obtained  according  to  the  following  standard  formulas  for  three- 
point  bending  of  ellipses  [8]: 

Stress  (cr),  (MPa)  =  FLc/47,  with  F  =  force,  L  =  length,  c  = 

ML  OR 
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Strain  (e),  (mm/mm)  =  12cd/L2,  with  c  —  ML  OR,  d  =  dis¬ 
placement,  L  =  length 

Young’s  Modulus  (E),  (MPa)  =  (F/d)(L3/487) 

FTIR  Imaging 

The  left  tibia  from  one  mouse  of  each  strain  was  fixed  in 
buffered  formalin,  dehydrated  in  ethanol,  and  embedded  in 
PMMA,  from  which  3  fim  midsagittal  sections  were  obtained 
for  FTIR  imaging.  FTIR  imaging  was  performed  using  a  Bio- 
Rad  Stingray  FTIR  imaging  system  equipped  with  a  64  x  64 
mercury-cadmium-telluride  detector,  under  continuous  nitrogen 
purge.  In  this  configuration,  each  field  of  analysis  corresponds  to 
a  400  x  400  pm,  and  each  pixel  to  ~6.3  x  6.3  pm  area.  Water 
vapor  and  PMMA  spectral  contributions  were  accounted  for,  and 
mineral  crystallinity  and  XLR  calcualted  as  described  elsewhere 
[31,  32, 44, 45].  Consequently,  in  each  analyzed  field,  pixels  in 
which  either  the  1660:1690  cm-1  ratio  fell  outside  of  the  0-2.5 
range  or  the  1030:1020  cm-1  ratio  fell  outside  the  0-2.0  range 
were  censored  as  bad  data. 

Statistical  Analysis 

Most  statistical  analyses  were  performed  using  SigmaStat 
2.03  (Jandel  Scientific)  and  the  remainder  was  performed  with 
Origin  version  5.0  (Microcal).  Student’s  r-test  compared  HcB/8 
and  HcB/23  mice  for  anatomic  features  and  biomechanical  per¬ 
formance,  with  values  for  failure  stress  log-transformed  prior 
to  analysis  to  satisfy  the  test  assumption  of  normality.  Signifi¬ 
cance  levels  were  Bonferroni-corrected  to  account  for  multiple 
coifipfarisops,  Iftterstrain  comparisons  of  XLR  and  crystallinity 
indices  were  Tone  by  x  2  contingency  table,  taking  the  weighted 
'  averagd  number  of  pixels  in  each  0.25  unit  interval  of  the  distri- 
•bution  for  each'strain.'The  Kolmogorov-Smimov  test  was  used 
to  assess  normality  of  the  FTIR  imaging  data.  We  used  all  bone 
spectra  to  assess  the  correlation  of  the  XLR  and  crystallinity  us¬ 
ing  the  Spearman  rank  order  method.  One-dimensional  spatial 
correlation  functions  (lDCs)  g(r)  were  calculated  for  vectors  r 
parallel  to  the  bone  surface  according  to  the  formula 

-  E.  fe  ~  x)(xi+r  -  x) 

a/EA  -  ^VEA+r  -x) 

where  x,-  is  the  index  at  a  given  pixel  i;x,+?  is  the  index  at  a 
pixel  located  at  a  pixel  r  pixels  from  /;  and  x  is  the  average 
value  of  the  index  over  valid  pixels  in  the  entire  field.  The  sum 
in  the  numerator  includes  overall  values  of  i  for  which  both  x,- 
and  Xj+r  are  valid  pixels  [46].  This  correlation  function,  which 
quantifies  the  degree  to  which  the  values  at  pixels  separated  by 
r  are  correlated,  is  commonly  used  in  statistical  mechanics.  The 
correlation  function  g(r)  is  unity  at  r  =  0;  in  our  samples  it  is 
found  to  decay  linearly  with  r  for  small  r,  and  we  define  the 
correlation  length  as  the  inverse  of  this  slope.  We  hypothesized 
that  strain  and  index  tested  (XLR  or  crystallinity)  would  each 
influence  the  correlation  length.  To  test  the  hypothesis,  a  straight- 
line  fit  was  made  over  the  linear,  small  r  region  of  g(r).  The 
slopes  were  analyzed  by  two-factor  ANOVA. 


RESULTS 

Calculated  Tissue  Strength  Differs  Between  HcB/8 
and  HcB/23 

We  examined  ten  6-month-old  female  HcB/8  and  nine  6- 
month-old  female  HcB/23  mice.  The  anatomic  and  biomechan¬ 
ical  properties  of  the  animals  are  summarized  in  Table  1.  These 
data  show  that  while  humeral  failure  load  and  structural  stiff¬ 
ness  is  similar  in  the  two  strains,  HcB/23  achieves  this  strength 
through  more  favorable  bone  architecture  and  less  favorable 
bone  tissue  strength,  whereas  HcB/8  does  so  through  less  favor¬ 
able  bone  architecture  and  more  favorable  bone  tissue  strength. 
The  inferiority  of  HcB/23  bone  tissue  strength  is  not  due  to  lesser 
mineralization,  as  this  strain  has  a  greater  ash  percentage  than 
HcB/8.  These  observations  prompted  us  to  use  FTIR  imaging 
to  explore  the  possible  basis  for  file  marked  difference  in  bone 
tissue  strength  between  the  strains,  examining  four  400-jum2 
sections  of  bone  from  each  strain. 

XLR  Differs  Between  HcB/8  and  HcB/23 

The  amide  I  and  II  absorption  bands,  which  in  bone  are  rep¬ 
resentative  principally  of  collagen,  are  broad  and  consist  of 
underlying  peaks,  each  corresponding  to  vibrations  in  a  spe¬ 
cific  chemical  environment.  Of  these  underlying  bands,  we  have 
shown  previously  that  the  ratio  of  absorbances  at  1660  cm-1  and 
1690  cm-1  is  representative  of  the  ratio  of  pyridinoline  to  deH- 
DHLNL,  two  of  the  major  cross-links  present  in  bone  collagen 


TABLE  1 

Anatomic  and  biomechanical  properties  of  HcB/8 
and  HcB/23  mice. 


Trait 

HcB/8 
(n  =  10) 

HcB/23 
>  =  9) 

p  value 

Body  mass  (g) 

19.9  ±1.5 

23.5  ±1.1 

I 

o 

1-H 

V 

BMI  (g/cm2) 

0.27  ±  0.01 

0.25  ±  0.01 

0.001 

Humeral  length 

11.1  ±0.3 

11.3  ±0.2 

NS 

(mm) 

ML  OD  (mm) 

1.07  ±  0.07 

1.23  ±0.04 

<10-5 

ML  ID  (mm) 

0.59  ±  0.07 

0.59  ±  0.05 

NS 

AP  OD  (mm) 

0.81  ±  0.05 

0.93  ±  0.07 

<0.001 

AP  ID  (mm) 

0.35  ±  0.04 

0.48  ±  0.08 

<0.001 

CSA  (mm2) 

0.49  ±  0.06 

0.67  ±  0.05 

<10~5 

I  (mm4) 

0.041  ±  0.009 

0.079  ±0.011 

<10~6 

Ash  percentage 

68.6  ±  1.0 

70.2  ±1.5 

NS 

Failure  load  (N) 

7.9  ±  1.4 

8.1  ±0.9 

NS 

Structural  stiffness 

22  ±7 

23  ±5 

NS 

(N/mm) 

Failure  stress  (MPa) 

192  ±  36 

118  ±13 

<10~4 

Young’s  modulus 

4800  ± 1570 

2610  ±490 

0.002 

(MPa) 

BMI  =  body  mass,  ML  =  mediolateral,  OD  =  outer  diameter,  ID  = 
inner  diameter,  AP  =  anteroposterior,  1  =  cross-sectional  moment  of 
inertia,  N  =  newtons,  MPa  =  megapascals.  The  Bonferroni-corrected 
significance  level  is  0.004.  v 
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[44].  Representative  sections  from  each  strain  are  shown  in 
Figure  1.  Overall,  the  difference  in  XLR  is  highly  significant 
(X2  =  1968,  9  degrees  of  freedom,  p  <  10'15).  The  mean  and 
median  HcB/23  XLR  indices  are  1.27  and  1.37,  respectively. 
For  HcB/8,  the  mean  XLR  index  is  1.65  and  the  median  is 
1.64.  Neither  of  the  distributions  is  normal,  with  p  <  10-15  for 
the  Kolmogorov-Smimov  test  of  normality  in  both  cases.  The 
HcB/23  distribution  is  more  negatively  skewed  than  the  HcB/8 
distribution,  with  values  of  —1.727  and  —0.185,  respectively. 
The  HcB/23  distribution  has  a  kurtosis  of  3.012  and  the  HcB/8 
distribution  has  a  kurtosis  of  1.198.  Thus,  in  addition  to  having 
different  XLRs,  the  distributions  are  shaped  differently. 

Crystallinity  Differs  Between  HcB/8  and  HcB/23 

The  ratio  of  absorbances  at  1030  cm-1  and  1020  cm-1  is  a 
composite  index  of  the  mineral  (apatite)  crystal  size  and  perfec¬ 
tion,  with  higher  ratios  indicating  greater  mineral  crystallinity. 
The  crystallinity  maps  of  the  two  strains  are  compared  in 
Figure  2.  As  for  XLR,  the  difference  between  the  strains  is  highly 
significant  x2  =  400,  6  degrees  of  freedom,  p  <  10~15).  Also 
as  for  XLR,  the  crystallinity  distributions  deviate  markedly  from 
normality  ( p  <  10~1S  in  both  cases).  The  mean  and  median  of 
the  crystallinity  index  in  HcB/23  are  0. 84  and  0.9 1 ,  respectively. 
For  HcB/8,  the  corresponding  values  are  0.95  and  0.97.  As  for 
the  XLR  distributions,  the'HcB/23  crystallinity  distribution  is 
more  negatively  skewed  (—1,408  and  —0.913)  and  more  kur- 
totic  (1.855  and  0.882)  than  the  HcB/8  distribution.  However, 

,  the'crystallinity  distributions  are  less  dissimilar  than  the  XLR 
distributions..  ’  •  . 

;  Tv>.  .  . 

XLRand  Crystallinity  Highly  Correlated 

Because  of  the  well-established  relationship  between  the  col¬ 
lagen  and  mineral  components  of  bone’s  extracellular  matrix, 
we  examined  the  pixel-by-pixel  correlation  of  the  XLR  and  the 
crystallinity  index.  These  properties  were  highly  and  similarly 
correlated  in  both  strains,  with  correlation  coefficients  of  0.751 
for  HcB/8  and  0.758  for  HcB/23.  In  both  strains,  the  results  are 
highly  significant  ( p  <  10~16). 

XLR  and  Crystallinity  Correlation  Lengths  Similar 
for  HcB/23  and  HcB/8 

The  FTIR  imaging  data  summarized  above  consider  pix¬ 
els  collectively,  without  regard  for  their  spatial  arrangement 
within  the  cortical  bone.  To  explore  whether  the  similarity  of 
closely  spaced  pixels  differs  between  strains,  we  investigated 
one-dimensional  spatial  correlations  in  each  of  the  FTIR  sam¬ 
ples.  Figure  3  shows  the  relationship  between  distance  parallel  to 
the  bone  surface  in  pixels  and  g(r),  the  correlation  of  the  XLR  at 
2  points  separated  by  distance  r .  Inspection  of  the  figure  shows 
that  for  pixel  separations  up  to  approximately  10  pixels,  corre¬ 
sponding  to  a  distance  of  70  pm,  the  decay  of  g(r)  is  linear.  For 
larger  r,  g(r)  varies  erratically;  this  is  not  surprising  because 
the  number  of  pixels  summed  over  becomes  small  at  large  r, 
leading  to  large  statistical  uncertainties.  Fitting  g(r)  to  the  form 


TABLE  2 

Estimates  of  A,  decay  of  correlation  over  distance. 


HcB/8 

HcB/23 

XLR 

-0.027  ±  0.010 

-0.029  ±  0.018 

Crystallinity 

-0.029  ±  0.017 

-0.032  ±  0.018 

XLR  =  1660:1690  absorbance  ratio;  crystallinity  = 
1030:1020  absorbance  ratio. 


g(r)  —  1  -  Ar  forr  <  10  yielded  estimates  of  A  as  summarized 
in  Table  2.  Two-factor  ANOVA  showed  no  evidence  that  strain, 
index,  or  their  interaction  affected  A. 

DISCUSSION 

In  our  initial  survey  of  bone  properties  of  the  HcB/Dem  re¬ 
combinant  congenic  strains,  we  noted  that  HcB/8  and  HcB/23 
are  similar  with  respect  to  failure  load  and  ash  percentage,  but 
that  HcB/23  has  much  inferior  calculated  tissue  strength  relative 
to  HcB/8.  While  this  difference  is  not  as  great  as  that  between  the 
parental  strains  C3H/DiSnA  and  C57BL/10ScSnA,  the  parental 
strains  differ  significantly  in  ash  percentage  as  well  [30].  These 
observations  led  us  to  explore  the  possibility  that  material  prop¬ 
erties  amenable  to  investigation  by  FTIR  imaging  might  provide 
insight  into  the  dramatic  difference  in  failure  stress  between  the 
strains. 

XLR  is  one  such  property.  Following  collagen  synthesis  and 
secretion,  individual  collagen  monomers  are  enzymatically  and 
nonenzymatically  cross-linked  in  the  extracellular  space.  In  bone, 
the  predominant  covalent  cross-links  between  type  I  collagen 
molecules  are  formed  by  condensation  of  hydroxylysine  and,  to 
a  lesser  extent,  lysine  residues.  Initial  cross-links  are  divalent  and 
reducible,  and  then  mature  into  the  trivalent  nonreducible  pyridi- 
noline,  deoxy-pyridinoliene,  and/orpyrrole  depending  on  the  ex¬ 
tent  of  hydroxylation  of  the  lysine  residues  involved  [see  47  for 
review].  We  previously  showed  that  the  1660  cm"1: 1690  cm-1 
absorbance  ratio  reflects  the  relative  amounts  of  two  of  the 
major  collagen  cross-links  present  in  bone,  pyridinoline  (sodium 
borohydrate  nonreducible,  trivalent)  and  deH-DBLNL  (sodium 
borohydrate  reducible,  divalent)  [44],  Since  thin  tissue  sections 
were  employed,  ratios  rather  than  absolute  amounts  were  re¬ 
ported  to  minimize  potential  artifacts  due  to  variations  in  sec¬ 
tion  thickness.  Cross-links  are  tissue-  rather  than  collagen  type- 
specific  and  confer  tensile  strength  and  viscoelasticity  on  bone 
matrix,  properties  that  improve  fracture  resistance  in  bending 
[47,48], 

Apatite  crystallinity  is  a  second  such  property.  The  mineral  in 
bone  is  a  poorly  cry  stalline  analogue  of  the  natural  occurring  ge¬ 
ologic  mineral,  hydroxyapatite.  In  bone,  the  apatite  is  hydroxide- 
deficient,  nonstoichiometric,  and  carbonate-enriched  [49,  50]. 
Apatite  crystallinity,  as  reflected  by  the  1030  cm_1:1020  cm-1 
absorbance  ratio,  reflects  the  size  and  perfection  of  the  crystals 
[51].  The  mineral  component  of  bone  matrix  is  poorly  crystalline 
and  highly  substituted,  a  property  that  facilitates  exchange  of 
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Figure  3.  Two-point  correlation  function  g(r)  versus  separation  r  (in  pixels)  for  r  directed  parallel  to  the  bone  surface.  The  graph  shows  one  of  five  samples 
analyzed  for  the  each  strain;  these  results  are  typical.  HcB/8  =  +  and  HcB/23  =  x .  The  dotted  lines  are  linear  fits  (g(r)  =  1-Ar)  to  the  initial,  linear  portions  of 
the  correlation  functions  (r  <  10). 


bone  and  blood  calcium  and  phosphate  levels  and  thus  con¬ 
tributes  to  bone’s  metabolic  function  as  a  reservoir  for  calcium. 
Our  data  suggest  that  increased  crystallinity  improves  mechani¬ 
cal  performance.  That  distinct  crystallinity  optima  for  mechan¬ 
ical  performance  and  mineral  homeostasis  exist  is  a  hypothesis 
that  warrants  further  study. 

We  report  a  high  point-by-point  correlation  between  XLR 
and  crystallinity  indices.  It  is  well  established  that  bone  miner¬ 
alization  depends  on  and  follows  organic  matrix  synthesis  tem¬ 
porally.  In  osteogenesis  imperfecta,  for  example,  defective  type 
I  collagen  is  the  underlying  problem,  but  bone  mineral  content 
generally  is  low  as  well.  Studies  examining  degree  of  mineraliza¬ 
tion  by  either  backscattered  electron  microscopy  [52, 53]  or  mi¬ 
croradiography  [54, 55]  demonstrate  that  mineralization  occurs 
in  two  phases:  an  initial,  relatively  rapid  initial  phase  followed 
by  a  subsequent  slower  continued  mineral  accretion.  These  ear¬ 
lier  studies  relate  degree  of  mineralization  to  matrix  maturation, 
whereas  the  data  presented  here  suggest  that  crystallinity  also  is 
related  to  matrix  maturation.  It  is  tempting  to  speculate  that  these 


.observations  are  in  fact  illustrations  of  a  more  general,  possibly 
mechanistic,  relationship  between  XLR  and  crystallinity. 

There  are  important  limitations  to  the  work  presented  here. 
First,  the  mechanical  test  performed,  three-point  bending,  mea¬ 
sures  whole  bone  strength,  not  tissue  strength.  Tissue  mod¬ 
ulus  can  be  measured  with  comparable  spatial  resolution  by 
nanoindentation.  Tissue  strength  is  calculated  from  the  struc¬ 
tural  strength,  bone  dimensions  measured  from  orthogonal  plain 
radiographs,  and  a  relatively  crude  simplifying  assumption  re¬ 
garding  the  shape  of  the  murine  humeral  diaphysis.  These  as¬ 
sumptions  lead  to  important  uncertainties  regarding  the  failure 
stress  and  modulus  estimates  reported  here.  These  technical  lim¬ 
itations  apply  equally  to  the  mechanical  testing  of  both  strains 
and  should  therefore  not  affect  the  magnitude  of  the  difference 
between  the  calculated  failure  stress  and  modulus  between  them. 
Moreover,  three-point  bend  testing  is  a  contrived  experimen¬ 
tal  fracture  model  and  may  differ  mechanistically  from  clinical 
fracture.  Of  the  whole  bone  tests,  three-point  bending  offers  sev¬ 
eral  advantages  over  either  four-point  bending  qr  torsion  testing. 
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Most  important  is  that  in  three-point  bending  the  fracture  occurs 
at  a  predetermined  position  in  the  bone,  rather  than  at  the  bone’s 
weakest  point,  wherever  that  may  lie  along  the  bone’s  length. 
Since  bone  diameters  vary  markedly  along  the  bone’s  long  axis, 
this  eliminates  an  important  potential  source  of  error  in  calculat¬ 
ing  tissue  strength.  While  nanoindentation  theoretically  allows 
direct  measurement  of  tissue  strength,  it  is  difficult  to  perform 
properly,  especially  when  working  with  bones  as  small  as  those 
used  in  these  experiments. 

The  second  limitation  is  that  the  FT®  imaging  was  restricted 
to  single  animals  of  each  strain.  While  the  number  of  individual 
spectra  analyzed  is  large,  interpretation  must  be  cautious  as  the 
animals  tested  might  not  be  representative  of  their  respective 
strains.  Third,  we  used  tibiae  for  FT®  imaging,  but  humeri  for 
mechanical  testing.  The  degree  to  which  this  difference  in  sites 
affects  the  validity  of  our  findings  is  an  open  question.  Both  the 
failure  stress  and  the  FT®  results  reflect  the  cortical  bone  tis¬ 
sue,  and  it  is  generally  unknown  how  much  intersite  variability 
exists  in  mammalian’ cortical  bone.  The  known  differences  in 
humeral  and  tibial  whole  bone  strength  can  be  attributed  to  dif¬ 
ferences  in  their  dimensions.  However,  more  subtle  differences 
in  the  tissue-level  properties  may  exist  and  this  study  provides 
no  insight  regarding  these.  Fourth,  we  examined  formalin-fixed 
specimens,  whereas  subsequent  work  has  demonstrated  that  this 
treatment  tends  to  increase  XLR  and  decrease  the  crystallinity 
index  relative  to  unfixed  specimens  [56].  As  for  the  technical 
limitations  of  the  mechanical  testing,  this  constraint  should  not 
affect  the  difference  between  the  strains.  •  ' 

‘  These  limitations  notwithstanding,  the  data  presented  here 
ptoVide  et»idijnce that  differences  in  XLR  and  crystallinity  reflect 
:  differences  in  biomechanical  performance  not  only  in  the  pres¬ 
ence  of- significant  pathology  [57-60],  but  also  among  strains 
of  wild-type  mice.  Turner  and  colleagues  [29]  also  found  evi¬ 
dence  suggesting  marked  differences  in  tissue  strength  among 
the  BXH  recombinant  inbred  strains  but  have  not  yet  provided 
a  mechanistic  hypothesis  regarding  the  origin  of  these  differ¬ 
ences.  A  fuller  understanding  of  the  role  of  these  and  other  non 
bone  mineral  content  components  of  bone  tissue  strength  will 
no  doubt  help  explain  the  apparent  disparities  between  fracture 
reduction  and  changes  in  bone  mineral  content  in  human  clinical 
trials  as  well. 
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Abstract 

Many  densitometric  studies  in  mice  assess  bone  mineral  density  (BMD)  at  specified  regions  of  interest,  often  using 
ex  vivo  specimens.  In  the  present  study,  we  sought  to  determine  the  precision  and  accuracy  of  ex  vivo  densitometry 
of  mouse  bones,  comparing  two  software  versions  and  two  data  acquisition  techniques.  The  newer  software  allows 
manual  adjustment  of  the  threshold  value  for  bone,  improving  the  ability  to  analyze  bone  edges  correctly.  Root  mean 
square  standard  deviations  were  2-3  mg/cm2,  with  coefficients  of  variation  ranging  between  3%  and  5%  for  femora 
and  humeri  and  between  6%  and  1%  for  radii.  The  regression  coefficients  for  bone  mineral  content  as  a  function  of 
ash  mass  were  near  1  for  femora  and  humeri,  but  considerably  lower  for  radii.  Coefficients  of  determination  were 
inversely  related  to  bone  size,  with  R2  values  exceeding  0.9  at  the  femur,  0.8  at  the  humerus,  and  ranging  between  0.3 
and  0.6  at  the  radius.  We  found  that  our  instrument  has  a  position  artifact,  with  BMD  and  bone  mineral  content  depen¬ 
dent  on  the  specimen’s  coordinates  in  the  scanned  field.  Our  findings  establish  the  limitations  of  ex  vivo  densitome¬ 
try  with  the  PlXImus  and  support  our  recommendation  that  investigators  seek  position  artifacts  in  their  instruments. 

Key  Words:  Bone  mineral  density  (BMD);  bone  mineral  content  (BMC);  dual  energy  X-ray  absorptiometry 
(DXA);  mouse;  accuracy;  precision. 


Introduction 

Dual-energy  X-ray  absorptiometry  (DXA)  is  the  most 
widely  used  imaging  methodology  to  assess  risk  of  fragility 
fracture.  In  addition  to  its  documented  ability  to  stratify 
patients  for  fracture  risk,  examination  is  accurate,  precise, 
noninvasive,  and  entails  only  modest  radiation  exposure. 
Moreover,  examination  is  feasible  at  the  lumbar  spine,  hip,  and 
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wrist,  corresponding  to  common  fracture  sites.  These  features 
make  DXA  an  attractive  choice  for  diagnosing  osteoporosis, 
following  patients  to  monitor  effectiveness  of  therapy  for 
osteoporosis,  and  as  a  surrogate  end  point  for  fracture  in  clin¬ 
ical  investigation.  Correspondingly,  DXA  has  become  a  popu¬ 
lar  analytical  technique  in  mice  and  other  small  animals,  either 
as  a  substitute  for  more  demanding  techniques,  such  as  bio¬ 
mechanical  testing,  histomorphometry,  and  gravimetry,  or  as 
an  adjunct  to  them. 

Recent  progress  in  skeletal  genetics  and  success  in  contin¬ 
uing  efforts  to  develop  bone  pharmaceuticals  have  been 
achieved  in  large  part  through  studies  performed  in  rodents. 
As  small-animal  skeletal  research  continues,  more  detailed 
questions  regarding  differences  in  the  behavior  at  specific 
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anatomical  sites  are  becoming  ever  more  prominent.  Several 
laboratories  explicitly  study  skeletal  phenotypes  at  specific 
sites  (e.g.,  refs.  1-5).  Interventional  studies  also  focus  on  site- 
specific  end  points  in  small-animal  models  (e.g.,  refs.  6  and 
7).  Such  experiments  require  that  site-specific  data  be  gener¬ 
ated  and  interpreted. 

As  in  humans,  site-specificity  of  bone  mineral  density 
(BMD)  has  been  reported  in  mice  (1,3,8, 9).  One  commonly 
used  small-animal  densitometer  was  designed  primarily  for 
determination  of  whole-body  BMD.  Although  smaller  regions 
of  interest  can  be  defined  and  analyzed,  few  normative  data  are 
available  for  interpreting  region  of  interest  data  (10-13). 
Moreover,  the  accuracy  and  precision  of  ex  vivo  bone  densito¬ 
metry  has  received  limited  attention.  In  order  to  address  these 
gaps  in  our  knowledge,  we  undertook  a  comparison  of  excised 
mouse  femora,  humeri,  and  radii  using  DXA.  We  compared 
results  of  femoral,  humeral,  and  radial  densitometry  per¬ 
formed  using  two  different  scanning  protocols  and  two 
versions  of  the  analysis  software,  comparing  densitometry  to 
gravimetric  methods. 

We  report  that  there  is  little  difference  in  accuracy  and  pre¬ 
cision  between  scans  performed  in  air  and  scans  performed  in 
water.  The  newer  software  is  better  able  to  analyze  the  edges 
of  radii,  the  smallest  and  least  dense  bones  we  examined.  We 
found  evidence  of  systematic  position  dependency  of  the  den¬ 
sitometry  data  in  our  instrument.  Our  data  suggest  that 
although  mouse  femora  and  humeri  can  fruitfully  be  studied 
by  ex  vivo  densitometry,  data  obtained  from  radii  should  be 
interpreted  cautiously.  Further,  our  finding  of  a  position  arti¬ 
fact  in  our  instrument  leads  us  to  recommend  that  all  investi¬ 
gators  perform  precision  studies  that  allow  such  an  effect  to 
be  sought. 

Materials  and  Methods 

Mice 

Mice  used  in  this  study  were  C57BL/6J  male  and  female 
retired  breeders  from  one  of  several  IACUC-approved  proto¬ 
cols.  Animals  were  housed  two  to  five  animals  in  a  500-cm2 
cage  on  corncob  bedding,  fed  laboratory  rodent  chow  5001 
(PMI  Nutrition  International,  Richmond,  IN)  and  tap  water  ad 
libitum,  and  exposed  to  a  12-h  light/12-h  dark  cycle.  Animals 
were  sacrificed  at  various  ages  by  CO2  asphyxiation.  Femora, 
humeri,  and  radii  were  dissected  free  of  soft  tissue  immedi¬ 
ately  following  sacrifice,  wrapped  in  phosphate-buffered 
saline-saturated  gauze,  and  stored  at  -70°C  until  analysis. 
One  radius  was  broken  during  dissection  and  was  not  avail¬ 
able  for  study. 

DXA  Scanning 

Dissected  bones  were  scanned  by  DXA  using  the  PIXImus 
(GE  Lunar,  Madison,  WI)  densitometer  using  software  version 
1.45.023.  Analysis  was  performed  using  either  software  ver¬ 
sion  1.45.023  (hereafter  shortened  to  1.45)  or  software  version 
2.10.013  (hereafter  shortened  to  2.10).  We  scanned  a  manu¬ 
facturer-provided  phantom  daily  during  data  collection  to  con¬ 


firm  the  instrument’s  calibration.  Bones  were  allowed  to  equi¬ 
librate  to  room  temperature  prior  to  scanning.  For  air  scanning, 
bones  from  two  mice  were  positioned  manually  in  air  on  a 
Plexiglas  platform  provided  by  the  instrument  manufacturer  to 
achieve  similar  orientations  for  right  and  left  bones.  For  water 
scanning,  bones  from  two  mice  were  positioned  manually  in  a 
2-mm-thick  polyethylene  dish  under  6  mm  of  water,  and  the 
dish  was  placed  on  top  of  the  Plexiglas  platform.  Both  air  and 
water  scans  were  repeated  after  rotating  the  platform  180°.  All 
scans  included  a  pair  of  wires  placed  at  one  comer.  Acquisition 
time  per  scan  was  approx  5  min.  We  manually  adjusted  regions 
of  interest  to  analyze  each  of  the  bones  individually.  Software 
version  1.45  did  not  allow  adjustment  of  thresholds,  so  we 
excluded  any  bones  with  edges  that  were  incorrectly  identified 
by  that  software  version. 

Gravimetric  Methods 

Individual  bone  volumes  were  calculated  according  to 
Archimedes’  principle  following  immersion  in  water  using  a 
Mettler  Toledo  AE240  balance  and  a  Mettler  Toledo  33360 
density  measurement  kit.  Bones  were  heated  overnight  at 
105°C  to  determine  dry  mass.  Bones  were  then  incinerated  at 
600°C  and  ash  mass  determined.  (Ash  percentage  =  ash 
mass/dry  mass  and  volumetric  bone  mineral  density  [vBMD] 
=  ash  mass/bone  volume.) 

Statistics 

Precision  of  DXA  bone  mineral  density  (BMD)  was  calcu¬ 
lated  using  Microsoft  Excel  and  the  International  Society  for 
Clinical  Densitometry’s  precision  calculation  tool  (www.iscd. 
org/links/calc.cfm),  considering  each  bone  as  a  “patient.”  Both 
root  mean  square  standard  deviation  and  percent  coefficient  of 
variation  (%CV)  are  reported.  Additional  statistical  calculations 
were  performed  using  SigmaStat  for  Windows,  version  2.03 
(SPSS,  Chicago,  IL).  The  significance  level  was  p  <  0.05  and 
adjusted  by  Bonferroni’s  correction  to  account  for  multiple 
comparisons.  Data  are  reported  as  mean  ±  standard  deviation 
unless  noted  otherwise.  We  constructed  Bland-Altman  plots 
(14)  for  each  bone  and  each  densitometry  technique. 

Results 

Software  Version  2.10.  Improves  Ability  to  Analyze 
the  Edges  of  Small  Bones 

We  performed  duplicate  densitometry  scans  of  ex  vivo 
mouse  bones,  scanning  the  femora,  humeri,  and  radii  of  20 
animals.  Scans  were  performed  in  air  and  in  water  to  compare 
these  techniques  for  accuracy  and  precision,  with  the  speci¬ 
mens  rotated  1 80°  as  a  group  between  scans.  Scans  were  per¬ 
formed  using  software  version  1 .45  but  were  analyzed  using 
both  software  versions.  Version  1 .45  does  not  allow  the  user 
to  adjust  the  threshold  value,  whereas  version  2.10  allows  this 
to  be  altered  from  the  default  value  of  1306.  Using  version 
1.45  or  the  default  threshold  in  version  2.10,  7/39  radii  stud¬ 
ied  were  misanalyzed  on  one  or  both  scans  (Fig.  1A). 
Adjusting  the  threshold  to  1292  corrected  analysis  of  the 
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Fig.  1.  Effects  of  threshold  adjustment.  (A)  shows  that 
edges  of  a  radius  (bottom  row,  middle  of  field)  are  misana- 
lyzed  at  the  default  threshold  of  1306.  In  (B),  the  threshold  of 
the  same  scan  has  been  readjusted  to  allow  the  bone  edges  to 
be  analyzed  correctly.  In  (C),  another  scan  is  shown,  demon¬ 
strating  the  high  background  that  occasionally  results  from 
threshold  adjustment.  The  same  adjusted  threshold  value  of 
1292  was  used  in  (B)  and  (C).  Each  scan  includes  a  pair  of 
wires  in  the  lower  left  comer  used  for  orienting  scans.  Color 
image  available  for  viewing  at  www.humanapress.  com. 


Table  1 


Precision  by  Bone,  Software,  and  Technique 


Femur 

Humerus 

Radius 

1.45,  air 

0.003 

0.002 

Undefined 

4.36% 

4.13% 

1 .45,  water 

0.002 

0.002 

Undefined 

3.54% 

4.50% 

2.10,  air 

0.003 

0.002 

0.002 

4.30% 

3.69% 

6.77% 

2. 1 0,  water 

0.002 

0.002 

0.002 

3.39% 

3.72% 

6.05% 

Note:  Precision  is  shown  as  the  root  mean  square  standard  deviation 
(in  g/cm2)  on  the  first  line  of  each  cell  and  as  %CV  on  the  second  line. 


radial  edges  (Fig.  IB)  in  all  but  two  cases,  but  at  the  cost  of 
increasing  nonspecific  background  (Fig.  1C).  The  increased 
background  precluded  analysis  of  one  pair  of  water  scans 
because  suitable  regions  of  interest  could  not  be  defined. 
Further  analyses  with  software  version  2.10  were  performed 
following  threshold  adjustment  to  1292. 

Accuracy,  But  Not  Precision,  Depends  on  Bone  Size 

We  performed  paired  scans  of  excised  bones  in  air  and  sub¬ 
merged  under  6  mm  of  distilled  water.  The  scans  were  ana¬ 
lyzed  using  each  software  version,  with  version  2.10  analysis 
following  adjustment  of  the  threshold  to  1292.  The  results  are 
summarized  in  Table  1 .  Because  of  edge  analysis  errors,  an 
insufficient  number  of  radii  were  analyzable  with  version  1.45 
to  allow  precision  to  be  estimated.  Precision  expressed  as 
grams  per  square  centimeter  is  comparable  at  all  three  sites, 
but  the  differences  in  bone  size  lead  to  precision  expressed  as 
%CV  to  be  comparable  at  the  femur  and  humerus,  but  to  be 
larger  at  the  radius. 

To  determine  accuracy,  we  regressed  the  average  bone  min¬ 
eral  content  (BMC)  for  the  paired  scans  obtained  by  each  tech¬ 
nique  on  ash  mass.  These  analyses  are  summarized  in  Table  2 
and  Fig.  2.  The  data  demonstrate  very  clearly  that  DXA  BMC 
is  most  accurate  at  the  femur  and  least  accurate  at  the  radius, 
with  coefficients  of  determination  greater  than  0.9  regardless 
of  technique  at  the  former  site  and  coefficients  of  determina¬ 
tion  ranging  from  0.32  to  0.52  at  the  latter  site.  The  regression 
was  highly  significant  in  every  case,  p  <  10~5  for  all  of  the 
regressions  except  that  of  radii  using  water  scanning  and  soft¬ 
ware  version  2.10,  for  which  p  =  4.6  x  10"4.  In  addition, 
although  the  coefficients  of  the  regression  equation  are  close  to 
the  ideal  of  unity  at  both  the  femur  and  humerus  regardless  of 
technique,  the  coefficient  for  the  radial  BMC  equation  is 
approx  0.65,  suggesting  that  each  additional  milligram  of  ash 
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Table  2 

Ash  Mass-BMC  Regressions 


Site 

Technique 

Coefficient 

Intercept 

R2 

Femur 

1.45,  air 

1.11 

-0.00013 

0.95 

1.45,  water 

0.998 

0.00130 

0.92 

2.10,  air 

1.08 

0.00151 

0.96 

2.10,  water 

1.06 

0.00099 

0.97 

Flumerus 

1.45,  air 

1.06 

-0.00054 

0.83 

1 .45,  water 

0.946 

0.00700 

0.80 

2.10,  air 

0.997 

0.00069 

0.89 

2.10,  water 

1.02 

0.00013 

0.91 

Radius 

1.45,  air 

0.67 

0.00058 

0.52 

1.45,  water 

0.64 

0.00046 

0.52 

2.10,  air 

0.64 

0.00086 

0.52 

2.10,  water 

0.59 

0.00116 

0.32 

content  results  in  only  0.65  additional  milligrams  of  BMC  as 
measured  by  the  PIXImus  instrument. 

Bland-Altman  analysis  allows  straightforward  recognition 
of  the  pattern  of  differences  between  ash  mass  and  BMC 
analyses.  This  graphical  method  plots  the  difference  between 
BMC  and  ash  mass  on  the  y-axis  and  the  average  of  BMC  and 
ash  mass  on  the  x-axis.  Significant  correlation  between  these 
demonstrates  a  systematic  relationship  between  mineral  con¬ 
tent  and  the  measurement  discrepancy  between  the  two  meth¬ 
ods.  Bland-Altman  plots  for  the  femora  obtained  by  each 
technique  are  shown  in  Fig.  3  and  data  for  all  three  sites  are 
summarized  in  Table  3.  Three  of  the  four  techniques  yielded 
significant  positive  correlations,  whereas  no  significant  corre¬ 
lations  were  found  at  either  the  humerus  or  radius  by  any  of 
the  techniques. 

Position  Artifacts  Have  a  Small  But  Significant 
Impact  on  Ex  Vivo  DXA  Data 

We  scanned  multiple  bones  at  once,  so  bones  were  not  posi¬ 
tioned  precisely  at  the  center  of  the  X-ray  cone  beam.  The 
analysis  software  records  the  coordinates  of  each  region  of 
interest,  allowing  us  to  seek  a  relationship  between  specimen 
position  and  the  DXA  data.  We  did  this  by  performing  step¬ 
wise  linear  regression  analyses  in  which  the  BMD  was  mod¬ 
eled  as  a  function  of  ash  mass,  bone  volume,  specimen  x 
coordinate,  and  specimen  y  coordinate,  using  data  from  the 
femora  and  humeri.  We  excluded  the  radii  from  this  analysis 


Fig.  2.  Regression  of  BMC  on  ash  mass  using  air  scanning 
and  software  version  1.45.  (A-C)  show  the  results  for  femora, 
humeri,  and  radii,  respectively.  Note  the  widening  of  the  con¬ 
fidence  intervals  as  bone  size  decreases.  See  Table  2  for  sta¬ 
tistics  related  to  this  figure. 
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Fig.  3.  Bland- Altman  plots  of  femora.  The  average  of  ash  mass  and  BMC  is  plotted  on  the  x-axis  and  the  difference  between  ash  mass  and 
BMC  on  the  y-axis.  The  dotted  lines  show  the  2-SD  confidence  interval  of  the  average  difference  between  BMC  and  ash  mass.  (A)  and  (B)  show 
analysis  with  software  version  1.45  and  (C)  and  (D)  show  analysis  with  software  version  2.10.  (A)  and  (C)  show  air  scanning  and  (B)  and  (D) 
show  water  scanning. 
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Table  3 


Summary  of  Bland-Altman  Analyses 


Site 

Technique 

Average 

(BMC -Ash  Mass) 

±  SD  (mg) 

R 

Femur 

1.45,  air 

2.9+  1.0 

0.50** 

1.45,  water 

1.2  ±1.1 

— 

2.10,  air 

3.6  ±0.9 

0.42** 

2.10,  water 

2.8  ±0.8 

0.40* 

Humerus 

1.45,  air 

0.3  ±  0.6 

— 

1 .45,  water 

-0.6  ±  0.6 

— 

2.10,  air 

0.7  ±  0.4 

— 

2.10,  water 

0.4  ±  0.4 

— 

Radius 

1.45,  air 

-0.5  ±  0.5 

— 

1 .45,  water 

-0.8  ±  0.5 

— 

2.10,  air 

0.4  ±  0.5 

— 

2.10,  water 

-0.2  ±  0.6 

— 

Note:  *  p  <  0.05;  **  p  <  0.01 


Table  4 

Stepwise  Regression  Model  Parameters 


Technique 

a 

b 

c 

d 

1.45,  air 

0.0279 

0.985 

0.920 

0.0000145 

0.027 

0.0000064 

0.003 

1 .45,  water 

0.0283 

0.973 

0.916 

0.0000105 

0.014 

NS 

2.10,  air 

0.0286 

0.980 

0.926 

0.0000137 

0.025 

0.0000065 

0.003 

2.10,  water 

0.0292 

0.960 

0.923 

0.0000091 

0.011 

NS 

Note:  Coefficients  are  given  on  the  top  line  of  each  cell;  AR2  is 
given  on  the  second  line,  p  <  10-5  for  the  coefficients  b  and  c.  The  d 
coefficient  is  only  significant  with  air  scanning,  with  p  =  6  x  10-3  for 
software  version  1.45  and  p  =  3x  10-3  for  software  version  2.10. 


because  the  accuracy  and  precision  of  DXA  at  this  radius  are 
much  poorer  than  at  the  femur  and  humerus.  Both  addition  and 
subtraction  of  variables  included  raw  x  and  y  coordinates  as 
significant  variables  when  scanning  was  performed  in  air,  with 
slight  differences  in  the  regression  equation  depending  on  soft¬ 
ware  version.  Water  scans  demonstrated  a  significant  effect  of 
the  x  coordinate,  but  not  the  y  coordinate,  with  the  same  mod¬ 
els  generated  by  addition  and  subtraction.  The  regression 
model  parameters  are  summarized  in  Table  4  and  the  model 
equations  are  all  of  the  form 

BMD  =  a  +  b  (ash  mass)  +  c  {x  coordinate)  + 
d  (y  coordinate) 


with  the  coordinates  given  in  pixels.  The  A R2  ranged  between 
0.01 1  and  0.027  for  the  x  coordinate  and  between  0  and  0.003 
for  the  y  coordinate.  Regression  models  for  air  scanning  had 
slightly  greater  explanatory  power  than  did  those  for  water 
scanning,  but  the  magnitude  of  the  position  artifact  was 
slightly  greater  in  air  than  in  water. 

Discussion 

In  rodent  experiments,  DXA  scanning  serves  a  variety  of 
related  functions.  One  such  is  to  use  BMD  or  BMC  as  a  sur¬ 
rogate  measurement  for  more  technically  demanding  methods 
such  as  gravimetry.  Another  is  to  evaluate  an  explanatory  fac¬ 
tor  when  biomechanical  testing  is  performed.  A  third  is  to 
assess  longitudinal  changes  in  a  single  animal.  Each  of  these 
applications  requires  understanding  of  both  the  performance 
characteristics  of  DXA  scanning  and  the  underlying  biology  of 
the  effects  being  sought. 

We  therefore  performed  DXA  examinations  of  ex  vivo  long 
bones  in  air  or  water.  We  then  used  two  software  releases  to 
analyze  the  scans.  Our  analyses  led  to  the  following  four  con¬ 
clusions.  First,  software  version  2.10  is  superior  to  version  1.45 
insofar  as  it  allows  the  user  to  adjust  the  bone/soft  tissue  thresh¬ 
old.  Second,  there  are  only  minimal  differences  in  precision 
and  accuracy  between  scans  performed  in  air  and  scans  per¬ 
formed  in  water.  Third,  there  is  an  inverse  relationship  between 
bone  size  and  DXA  performance.  Fourth,  our  instrument  has  a 
small  but  significant  position  artifact,  with  the  BMD  varying 
systematically  according  to  a  specimen’s  placement  in  the  tar¬ 
get  area.  We  now  comment  on  each  of  these  observations. 

The  ability  to  adjust  the  threshold  allowed  us  to  interpret 
regions  of  interest  in  version  2.10  that  had  been  misanalyzed 
in  version  1.45.  Practically,  this  meant  that  we  were  able  to 
achieve  a  sufficient  sample  size  to  calculate  precision  of  radial 
scans  with  the  newer,  but  not  the  older,  software.  This  adjust¬ 
ment  entails  a  cost,  however,  as  the  background  noise  of  the 
scan  is  increased  with  a  downward  adjustment  of  the  thresh¬ 
old.  In  our  experiment,  the  increased  background  intensity 
made  two  scans  globally  uninterpretable. 

Although  precision  and  accuracy  differed  slightly  according 
to  technique  and  software  version,  these  variations  had  minimal 
impact.  Our  precision  and  accuracy  results  are  comparable  to 
those  obtained  by  Iida-Klein  and  co-workers  (13)  and  those 
obtained  by  Libouban  and  colleagues  (12).  These  differences  can 
be  attributed  in  part  to  differences  in  the  details  of  the  three  stud¬ 
ies.  At  the  femur,  the  only  common  site  studied  by  us  and  Iida- 
Klein  et  al.,  these  authors  reported  an  in  vivo  BMD  %CV  of 
3.09%  with  a  16-h  interval  between  scans,  although  the  %CV 
was  only  1 .98%.  Iida- Klein’s  study  entailed  more  specimen  han¬ 
dling  than  ours,  whereas  Libouban’s  studied  rat  bones  rather  than 
mouse  bones.  Our  results  support  the  practice  of  performing  ex 
vivo  scans  in  air  rather  than  submerged  in  water  or  a  buffer.  We 
advocate  air  scanning  because  controlling  buffer  depth  and  com¬ 
position  requires  additional  manipulation  of  specimens  and 
introduces  more  chances  for  operator  error.  Moreover,  the  com¬ 
parability  of  our  results  with  those  of  Iida-Klein  suggests  that 
there  is  little  to  be  gained  by  fixing  specimens  prior  to  study. 
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With  regard  to  scanning  submerged  specimens,  it  is  worth  noting 
that  Kastl  and  colleagues  ( 10)  observed  performance  differences 
based  on  buffer  salinity. 

We  observed  an  inverse  relationship  between  the  size  of  a 
scanned  bone  and  the  precision  and  accuracy  of  DXA  mea¬ 
surements.  This  is  unsurprising,  as  the  same  relationship  has 
been  established  in  numerous  human  and  animal  studies.  Pixel 
size  (0.18  mm2;  www.faxitron.com/PIXImus_specs.htm)  is 
large  relative  to  mouse  long  bone  area  (ranging  between  a 
maximum  of  0.54  cm2  for  femora  to  a  minimum  of  0.06  cm2 
for  radii;  average  data  given  in  Table  2).  It  is  also  important  to 
note  that  the  performance  of  gravimetric  determination  of 
BMD  and  BMC  is  poorer  in  smaller  bones.  Although  ashing  is 
the  historical  “gold  standard”  for  measuring  BMC,  it  is  impor¬ 
tant  to  recall  that  it  is  an  imperfect  method  for  very  small 
bones,  such  as  mouse  radii.  This  limitation  leads  us  to  recom¬ 
mend  that  other  investigators  performing  site-specific  studies 
use  the  largest  regions  of  interest  compatible  with  their 
research  questions. 

Finding  a  position  dependence  of  the  BMD  was  an  unex¬ 
pected  result  of  this  study.  To  our  knowledge,  our’s  is  the  first 
study  to  address  this  issue  with  the  PIXImus  densitometer. 
The  statistical  analysis  required  to  detect  and  introduce 
appropriate  adjustment  for  this  artifact  was  straightforward, 
relying  on  stepwise  linear  regression.  By  including  specimen 
x  and  y  coordinates  as  covariates  in  this  precision  study,  we 
were  able  to  determine  coefficients  for  positional  adjustment 
for  our  instrument.  The  position  artifact  was  calculated  using 
pooled  data  for  all  three  bones,  and  our  experiments  did  not 
provide  evidence  that  the  artifact  is  dependent  on  bone  size. 
As  for  densitometry  in  clinical  practice,  investigators  should 
perform  precision  studies  of  their  instruments  at  regular 
intervals.  We  recommend  that  they  also  seek  position  arti¬ 
facts,  as  we  have  done  here.  We  expect  that  position  artifacts 
will  be  instrument  dependent,  so  although  our  approach  is 
broadly  applicable,  the  regression  coefficients  are  specific  to 
our  instrument. 

An  important  limitation  of  our  study  is  that  we  did  not  com¬ 
pare  the  same  bones  in  vivo  and  ex  vivo.  Although  that  com¬ 
parison  would  be  of  interest  to  investigators,  logistical 
limitations  made  it  impossible  for  us  to  include  it  in  our  study. 
Iida- Klein  and  colleagues  (13)  included  this  comparison  in 
their  investigation  and  found  a  %CV  of  9.47%  between  in  vivo 
and  ex  vivo  scans  of  the  femur. 

Our  results  provide  guidance  regarding  application  of  DXA 
to  ex  vivo  small-animal  specimens,  and  it  is  important  to  note 
that  the  manufacturer  only  recommends  use  measurement  of 
whole-body  BMD  in  intact  animals.  Nevertheless,  there  are 
numerous  studies  reporting  analysis  of  isolated  specimens,  so 
the  observations  reported  here  are  relevant  to  current  work  in 
many  laboratories. 
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It  has  now  been  10  years  since  Morrison  et 
al.  (1)  reported  an  association  between 
vitamin  D  receptor  (VDR)  genotype  and 
bone  mineral  density  (BMD),  thus  spurring 
the  hope  that  it  would  be  possible  to  identify 
specific  alleles  of  a  small  number  of  “bone 
genes”  that  would  account  for  observed 
population  variability  of  fragility  fracture  risk. 
Although  many  scientists,  myself  included, 
doubt  that  the  reported  VDR  alleles  play  an 
important  role  in  bone  biology,  the  authors 
nevertheless  succeeded  in  raising 
awareness  in  the  bone  community  about 
genetic  methods  and  the  contribution  of 
hereditary  factors  in  determining  clinically 
important  bone  properties.  Because  a  great 
deal  of  progress  has  been  made  in  the 
intervening  decade,  providing  an 
encyclopaedic  review  of  work  in  the  field  is 
not  possible.  This  essay  reviews  some  of 
the  most  important  findings  in  a  thematic 
fashion,  with  apologies  to  investigators 
whose  research  is  not  cited. 

Identification  of  Molecular 
Signaling  Pathways  Whose  Roles 
in  Skeletal  Health  Were  Previously 
Unappreciated 

Rare  diseases,  the  genetics  of  which  follow 
a  simple  Mendelian  pattern  of  inheritance, 
offer  a  powerful  tool  for  understanding  some 
of  the  molecular  mechanisms  that  operate  in 
establishing  the  size  and  shape  of  bones 
and  the  properties  of  the  matrix  within 
bones.  Sometimes,  identifying  the  mutated 
gene  offers  limited  new  insight,  as  in  the 


case  of  osteogenesis  imperfecta.  It 
surprises  no  one  that  mutations  affecting  the 
major  structural  protein  in  bone  matrix  result 
in  skeletal  fragility  of  varying  severity.  This 
is  not  the  case  for  several  other  Mendelian 
disorders,  in  which  identifying  the 
responsible  genes  has  revealed  previously 
unsuspected  biology. 

The  importance  of  the  WNT  signaling 
pathway  in  establishing  bone  mass  is 
perhaps  the  best  known  example  of  an 
unexpected  biological  mechanism 
underlying  a  bone  disease,  but  it  is  not  the 
only  one.  Loss  of  function  mutations  of  the 
low-density  lipoprotein  receptor-related 
protein  5  gene  ( LRP5)  cause  osteoporosis- 
pseudoglioma  syndrome  (2-4),  whereas  gain 
of  function  mutations  in  the  gene  result  in 
the  high  bone  mass  phenotype  (5-7).  The 
WNT  pathway  was  well  known  prior  to  the 
discovery  that  it  contributes  to  bone  mass 
regulation,  dating  back  to  early  work  on  the 
Drosophila  mutation  wingless.  Investigation 
of  the  precise  roles  of  LRP5  and  the  other 
genes  involved  in  the  WNT  pathway  in  bone 
is  an  active  research  area  that  is  informed 
by  an  understanding  of  WNT  biology  in  a 
variety  of  other  settings. 

Positional  cloning  has  also  been  central  in 
establishing  that  a  phosphate  regulatory 
mechanism  operates  in  conjunction  with 
parathyroid  hormone  and  vitamin  D.  Once 
again,  uncommon  hereditary  diseases  with 
simple  inheritance  patterns  -  X-linked 
hypophosphatemic  rickets  (XLH)  (8,9)  and 
autosomal  dominant  hypophosphatemic 
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rickets  (ADHR)  --  were  the  starting  points 
(10).  The  determination  that  fibroblast 
growth  factor  23  (FGF-23)  is  phosphaturic,  a 
phosphate-regulating  endopeptidase  on  the 
X  chromosome  (PHEX)  substrate,  and 
resistant  to  proteolytic  degradation  when  an 
ADHR  mutation  is  present  establishes  a 
mechanistic  link  between  the  two  diseases 
(11,12).  Subsequently,  high  circulating 
levels  of  FGF-23  have  also  been  found  in 
patients  with  tumor-induced  osteomalacia 
(13,14),  extending  the  scope  of  the 
pathway’s  actions  to  acquired  disorders. 

The  PHEX/FGF-23  story  is  more  complex, 
however.  Both  matrix  extracellular 
phosphoglycoprotein  (15,16)  and  frizzled- 
related  protein  4  (17,18)  have  been 
proposed  as  phosphaturic  factors. 
Moreover,  some  investigators  have  failed  to 
find  elevations  of  FGF-23  in  XLH  or  PHEX- 
mediated  degradation  of  FGF-23  in  vitro, 
whereas  others  have  proposed  that  PHEX 
may  affect  phosphate  regulation  and  bone 
matrix  mineralization  through  mechanisms 
other  than  proteolysis  of  putative 
phosphaturic  factors  (19,20).  Further 
complexity  arises  from  FGF-23  and  PHEX  in 
the  kidney;  relationships  to  both  renal 
phosphate  transport  and  vitamin  D  1- 
hydroxylation  have  been  demonstrated,  but 
not  yet  fully  characterized  (e.g.,  21-23). 
These  conflicting  observations  support  the 
existence  of  multiple  important  phosphaturic 
factors,  and  ongoing  research  seeks  to 
better  define  the  roles  of  each  in  mediating 
phosphaturia  and  bone  matrix 
mineralization.  The  key  point  in  the  present 
context  --  that  investigation  of  rare  genetic 
diseases  has  spurred  discovery  of  novel  and 
unexpected  biology  --  is  strengthened  by 
current  efforts  to  reconcile  seemingly 
inconsistent  data. 

In  both  examples  cited  above,  fruitful  new 
hypotheses  have  emerged  because  the 
identities  of  disease-causing  genes  were 
surprising.  None  of  these  genes  would  have 
been  considered  attractive  candidates  in  the 
absence  of  compelling  linkage  data,  and  our 
efforts  to  assimilate  the  implications  of  the 
linkage  data  have  clearly  advanced  our 
understanding  of  bone  mass  and  phosphate 
homeostasis. 


Insightful  Use  of  Animal  Models 

Animal  models  allow  investigators  to 
measure  more  relevant  and  more  precisely 
defined  phenotypes  than  is  possible  in 
clinical  studies.  Indeed,  in  the  examples 
above,  the  focus  of  ongoing  research  has 
shifted  from  informative  families  to  either 
naturally  occurring  mutant  mice  or 
genetically  engineered  mice  (24-27). 

Mouse  and  rat  models  are  now  so  well 
integrated  into  bone  research  that  there  is 
no  need  to  belabor  the  methods  by  which 
they  are  generated  or  the  range  of  studies  to 
which  they  have  been  applied.  However,  it 
is  worth  noting  that  several  general-purpose 
model  animals  have  been  developed,  such 
as  those  expressing  reporter  genes  or  Cre 
recombinase  in  a  tissue-restricted  fashion 
(e.g.,  28,29). 

Animal  models  remain  a  particularly 
powerful  resource  for  identifying  genes  that 
affect  bone  properties,  particularly  in  the 
context  of  identifying  quantitative  trait  loci 
(QTLs).  The  concept  underlying  QTL 
genetics  is  that  alternative  alleles  of  multiple 
genes  each  contribute  incrementally  to  a 
trait  of  interest.  Furthermore,  segregation  of 
the  various  QTL  alleles  is  a  major  source  of 
variation  in  the  trait  among  members  of  a 
population.  Although  human  families  are 
enormously  useful  in  identifying  genes  and 
pathways  involved  in  diseases  with 
transparent  inheritance  patterns,  human 
samples  are  poorly  suited  for  determining 
the  basis  of  differences  in  bone  properties 
among  members  of  a  population. 
Population-level  differences  in  bone 
properties  are  much  more  subtle  than  those 
encountered  in  disease  states  and  are 
modified  by  environmental  factors  and  age. 
Moreover,  human  alleles  that  are  identical 
by  state  ( i.e according  to  the  detection 
method,  such  as  microsatellite  size  or 
restriction  fragment  length  polymorphism) 
might  not  be  identical  by  descent  (i.e., 
ancestral  origins  may  differ  and  therefore 
they  may  be  truly  different  in  a  genetic 
sense).  Experimental  crosses  in  animals 
obviate  some  of  the  problems  attending 
investigation  of  population  variation. 
Invasive  phenotypes  that  cannot  be  studied 
in  humans  can  be  investigated  in  model 
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organisms.  Husbandry  can  be  controlled 
and  animals  studied  at  the  same  age,  thus 
limiting  environmental  sources  of  variability. 
Because  inbred  strains  are  available  for 
study,  one  can  limit  the  number  of  alleles 
segregating  at  each  locus.  Use  of  inbred 
strains  also  eliminates  the  problem  of 
distinguishing  identity  by  descent  from 
identity  by  state.  In  experimental  crosses,  it 
is  straightforward  to  determine  both  linkage 
(i.e.  location  of  a  gene)  and  association  (i.e. 
which  allele  favors  a  high  value  of  the  trait 
being  studied).  These  advantages  are 
considerable,  but  entail  some  costs.  There 
is  no  assurance  that  bone  loci  found  in  an 
experimental  cross  will  prove  important  in 
humans.  Moreover,  a  cross  can  only 
identify  a  locus  if  the  parental  strains  harbor 
different  alleles  and  the  effect  size  is 
sufficiently  large  to  allow  detection.  Finally, 
there  is  considerable  work  between  mapping 
a  locus  and  identifying  the  responsible  gene. 
Limitations  notwithstanding,  experimental 
crosses  have  moved  from  being  a  promising 
strategy  to  a  proven  method  for  identifying 
genes  that  control  bone  properties. 

In  at  least  one  case  thus  far,  a  QTL  mapping 
study  in  mice  has  led  to  the  identification  of 
a  gene  -  Alox15  -  and  not  just  a 
chromosomal  region  (30).  This  study, 
conducted  by  Klein  and  colleagues,  is  the 
culmination  of  research  started  in  the  late 
1990s  (31,32).  The  recently  reported 
findings  include  the  validation  of  a 
chromosome  11  QTL  in  congenic  strains; 
recognition  of  Alox15  as  a  positional 
candidate  gene,  based  on  the  strength  of  a 
microarray  gene  expression  analysis, 
demonstrating  an  approximate  35-fold 
difference  in  Alox15  expression  between 
C57BL/6  (low)  and  DBA/2  (high);  and 
experimental  demonstration  of  the  impact  of 
Alox15  on  femoral  BMD  and  biomechanical 
performance  in  three  distinct  and 
independent  experimental  tests.  Based  on 
the  known  ability  of  Alox15  to  metabolize 
arachidonic  and  linoleic  acids  to  peroxisome 
proliferator-activated  receptor  gamma 
agonists,  the  investigators  hypothesized  that 
high  Alox15  activity  favors  differentiation  of 
mesenchymal  stem  cells  along  the 
adipocyte  lineage,  in  preference  to  the 
osteoblast  lineage.  As  in  the  cases  of 
LRP5,  FGF-23,  and  PHEX,  this  work  reveals 


a  metabolic  pathway  that  was  previously  not 
suspected  of  having  important  effects  on 
bone.  Adding  further  to  the  interest  of  this 
story  is  the  prior  detection  of  a  human  BMD 
QTL  in  the  region  harboring  ALOX15  (33). 
The  path  from  validated  QTL  (i.e.,  the 
preservation  of  the  effect  in  a  congenic 
strain)  to  identification  of  the  responsible 
gene  is  not  always  as  rapid  and 
straightforward  as  in  the  case  of  Alox15. 
Sometimes,  on  further  breeding,  a  linkage 
peak  proves  not  to  be  a  single  locus,  but  a 
group  of  linked  genes.  The  bone  group  in 
the  Jackson  Laboratory  has  demonstrated 
precisely  this  phenomenon  on  mouse 
chromosome  1  (34),  and  the  same  may  be 
true  of  other  QTLs  segregating  in  the 
C57BL/6J  x  C3H/HeJ  cross  (35). 
Furthermore,  the  number  of  positional 
candidate  genes  -  even  if  isolated  within  a 
congenic  segment  -  remains  large,  and 
testing  these  systematically  is  an  arduous 
task. 

Of  interest,  human  allele  association  studies 
can  be  a  powerful  tool  in  the  endgame  of 
identifying  which  gene  in  a  candidate 
interval  is  responsible  for  the  phenotypic 
effect.  The  ability  to  undertake  genetic 
studies  involving  multiple  species  is  made 
possible  by  the  existence  of  comparative 
genetic  maps  (e.g.,  the  mouse-human 
comparative  map  is  available  at 
http://www.ncbi.nlm.nih.qov/Homoloav/Davis 
/).  This  hybrid  strategy  is  being  pursued  to 
find  an  X-linked  gene  identified  as 
contributing  to  postmaturity  BMD  change  in 
mice  and  lumbar  BMD  in  postmenopausal 
women,  narrowing  the  search  to  one  of  two 
genes  (36).  The  short  distances  over  which 
linkage  disequilibrium  persists  and  the 
complex  pattern  of  population  mixing  among 
humans  facilitate  dissection  of  a 
chromosome  region  that  is  too  small  for 
crossovers  to  be  helpful  in  a  mouse  model. 
Thus,  some  of  the  very  features  that  make 
human  populations  unattractive  for  genome¬ 
wide  scans  make  them  ideal  for  fine-scale 
mapping  of  an  established  positional 
candidate. 

Although  identifying  the  responsible  genes 
remains  the  primary  goal  of  experimental 
crosses,  congenic  strains  constructed  to 
confirm  linkage  are  valuable  in  a  number  of 
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ways  that  are  independent  of  their  utility  in 
identifying  the  gene(s)  underlying  a  QTL. 
Unlike  the  unique  animals  generated  in  a 
cross,  congenics  are  inbred,  allowing  the 
study  of  multiple  genetically  matched 
individuals.  In  a  cross,  investigators  often 
choose  to  sacrifice  the  biological  depth  of 
the  traits  being  measured  to  gain  either 
precision,  ease  of  measurement,  or  both  - 
deferring  more  detailed  and  biologically 
informative  phenotypic  studies  until 
congenics  are  available.  Thus,  for  example, 
although  a  QTL  on  chromosome  4  was 
mapped  because  of  its  effect  on  femoral 
volumetric  BMD,  experiments  with  the 
resulting  congenic  strain  have  demonstrated 
that  there  is  a  mechanical  responsiveness 
gene  within  the  donor  segment  (37). 
Congenic  strains  are  also  powerful 
resources  in  studies  addressing  interactions 
among  QTLs  (38),  between  QTLs  and 
environment  (39),  and  between  QTLs  and 
sex  (40,41).  Such  experiments  exploit  the 
advantages  of  animal  models,  and  the  study 
of  interactions  is  an  area  in  which  research 
with  experimental  animals  will  continue  to 
provide  more  insight  than  can  be  gained 
from  clinical  studies. 

Human  Genetic  Methodology 

Genome-wide  screens  for  bone  QTLs  in 
humans,  by  either  linkage  or  association 
methods,  are  potentially  problematic,  but 
recent  methodological  advances  promise  to 
help  overcome  two  of  the  limitations  inherent 
in  human  quantitative  trait  genetics. 
Genome-wide  scans  in  humans  for 
quantitative  bone  traits  are  generally 
underpowered,  regardless  of  whether 
linkage  or  association  methods  are  used. 
Underpowered  genetic  studies  lead  to  two 
related  errors:  failure  to  identify  QTLs  and 
overestimation  of  the  effect  sizes  of 
identified  QTLs  (42).  Cross-sectional  allele 
association  studies  are  subject  to  false¬ 
positive  results  that  arise  from  population 
stratification.  Research  addressing  each  of 
these  problems  is  briefly  noted  here. 

The  Framingham  investigators  have  applied 
principal  components  analysis  to  extract 
synthetic  uncorrelated  principal  component 
phenotypes  from  a  larger  number  of 
interdependent  bone-related  raw  data 


measurements,  including  multiple  dual¬ 
energy  x-ray  absorptiometry  and  quantitative 
heel  ultrasound  parameters  (43).  The 
approach  is  conceptually  simple  and  can  be 
performed  with  standard  statistical  analysis 
software.  The  multiple  raw  measurements 
are  first  combined  into  a  composite  measure 
and  then  broken  down  into  a  minimal  set  of 
orthogonal  vectors  (/'.e.,  the  principal 
components).  All  lumbar  spine,  femoral, 
and  calcaneal  data  from  the  Framingham 
study  can  be  represented  as  two  principal 
components  that  accounted  for  66%  and 
24%  of  the  composite  phenotype  variability. 
In  the  case  of  hip  data  alone,  a  single 
principal  component  was  extracted  that 
accounted  for  90%  of  the  variability. 
Mapping  the  principal  components  identified 
several  regions  suggestive  of  linkage,  but 
none  of  the  regions  achieved  genome-wide 
statistical  significance.  It  is  important  to 
note,  however,  that  the  principal  component 
heritabilities  exceeded  those  of  the 
individual  underlying  measurements,  thus 
demonstrating  that  they  are  more  robust  for 
mapping  studies.  Therefore,  despite  the 
failure  to  achieve  significant  linkage,  this 
paper  is  important  because  of  the 
methodological  innovation. 

Using  data  from  the  Indiana  Sisters  Study, 
the  bone  genetics  group  at  Indiana 
University  has  measured  the  effects  of 
population  stratification  on  the  rate  of  false¬ 
positive  associations  (44).  If  a  population  is 
stratified,  association  can  arise  artifactually  - 
-  as  a  consequence  of  coincidental 
differences  in  allele  frequencies  between 
two  population  groups  that  also  differ  with 
regard  to  the  trait  being  studied,  rather  than 
as  a  consequence  of  a  true  biological 
connection  between  the  phenotype  being 
studied.  A  trivial  example  illustrates  the 
point:  although  the  HBBS  allele  is 

associated  with  skin  pigmentation,  all 
recognize  that  this  association  reflects  the 
higher  prevalence  of  sickle  cell  hemoglobin 
in  individuals  of  African  descent,  rather  than 
an  inherent  causal  relationship  between 
hemoglobin  genotype  and  skin  color.  The 
problem  of  population  stratification  is 
common  ( i.e .,  most  populations  are 
stratified),  and  the  degree  of  stratification  is 
often  difficult  or  impossible  to  quantify.  The 
investigators  found  that  increasing 
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admixture  increases  the  rate  of  false- 

positives  in  association  studies  and  that  one 

can  nevertheless  address  admixture  by 
including  unlinked  "negative  control" 

polymorphisms  to  estimate  the  magnitude  of 
the  admixture  effect.  Their  proposed 
inclusion  of  unlinked  loci  to  control  for 
admixture  provides  a  simple,  practical 

approach  that  can  be  used  as  an  alternative 
to  the  more  difficult  alternative  of  performing 
association  studies  using  the  two-generation 
transmission  disequilibrium  test. 

A  Philosophical  Reflection  on 
Complex  Trait  Genetics 

Ultimately,  as  a  community,  we  face  the  task 
of  achieving  a  philosophical  advance  to 
match  the  scientific  advances  of  recent 
years.  As  scientists,  we  are  deeply 
committed  to  seeking  and  understanding  the 
causes  underlying  natural  phenomena.  Yet, 
our  notions  of  causality  are  both  inadequate 
and  imprecise.  We  generally  agree  about 
acknowledging  the  “causality”  of  factors  that 
are  either  necessary  or  sufficient  for  a 
specified  outcome.  Neither  of  these  criteria 
of  causality  seems  to  be  adequate  for 
helping  us  understand  complex  trait 
genetics,  because  genes  that  contribute  to 
bone  properties  may  be  neither  necessary 
nor  sufficient  to  determine  biologically 
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ABSTRACT:  We  studied  reproducibility  of  the  ISCD  vertebral  exclusion  criteria  among  four  interpreters. 
Surprisingly,  agreement  among  interpreters  was  only  moderate,  because  of  differences  in  threshold  for 
diagnosing  focal  structural  defects  and  choice  of  which  vertebra  among  a  pair  discordant  for  T-score,  area, 
or  BMC  to  exclude.  Our  results  suggest  that  reproducibility  may  be  improved  by  specifically  addressing  the 
sources  of  interobserver  disagreement 


Introduction:  Although  DXA  is  widely  used  to  measure  vertebral  BMD,  its  interpretation  is  subject  to 
multiple  confounders  including  osteoarthritis,  aortic  calcification,  and  scoliosis.  In  an  attempt  to  standardize 
interpretation  and  minimize  the  impact  of  artifacts,  the  International  Society  for  Clinical  Densitometry 
(ISCD)  established  criteria  for  vertebral  exclusion,  including  the  presence  of  a  focal  structural  defect  (FSD), 
discrepancy  of  >1-SD  in  T-score  between  adjacent  vertebrae,  and  a  lack  of  increase  in  BMC  or  area  from  1^ 
to  L4.  Whereas  the  efforts  of  the  ISCD  represent  an  important  advance  in  BMD  interpretation,  the  interob- 
server  reproducibility  with  application  of  these  criteria  is  unknown.  We  hypothesized  that  there  would  be 
substantial  agreement  among  four  interpreters  regarding  application  of  the  exclusion  criteria  and  the  final 
lumbar  spine  T-score. 

Materials  and  Methods:  Each  interpreter  read  a  set  of  200  lumbar  DXA  scans  obtained  on  male  veterans, 
applying  the  ISCD  vertebral  body  exclusion  criteria. 

Results:  Surprisingly,  agreement  among  interpreters  was  only  moderate.  Differences  in  interpretation  resulted 
from  differing  thresholds  for  recognition  of  FSD  and  the  choice  of  excluding  the  upper  or  lower  vertebral  body 
for  the  criteria  requiring  'comparison  between  adjacent  vertebrae. 

Conclusions:  Despite  their  apparent  simplicity,  the  ISCD  vertebral  exclusion  criteria  are  difficult  to  apply 
consistently.  In  principle,  appropriate  refinement  of  the  exclusion  criteria  may  significantly  improve  interob- 
* .sbfVfcjr' Ug^ertient.- 

3  Bone’  Miner  Res  2005;20:501-508.  Published  online  on  November  29,  2004;  doi:  10.1359/JBMR.041134 
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INTRODUCTION 

BMD  is  the  single  best  test  to  predict  fragility  frac¬ 
ture/1"45  Although  various  biochemical  markers  of 
bone  turnover,  ultrasonic  properties  of  bone,  and  quantita¬ 
tive  CT  scanning  each  predict  fragility  fracture,<5'10)  BMD 
measured  using  DXA  remains  the  gold  standard  for  osteo¬ 
porosis  diagnosis  and  fracture  risk  assessment  in  clinical 
practice.  However,  DXA  is  an  imperfect  tool.  For  example, 
interpretation  of  lumbar  spine  bone  mass  is  confounded  by 
artifacts  including  degenerative  arthritis,  aortic  calcifica¬ 
tion,  and  scoliosis/115  To  address  this  problem,  the  Inter- 
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artis,  NPS  Pharmaceuticals,  and  Roche.  All  other  authors  have  no 
conflict  of  interest. 


national  Society  for  Clinical  Densitometry  (ISCD)  cur¬ 
rently  recommends  exclusion  of  individual  vertebrae  when 
interpreting  lumbar  spine  BMD  if  any  of  four  criteria  are 
■present/125  Specifically,  a  vertebral  body  is  excluded  if 
there  is  a  focal  structural  defect  (FSD),  unusual  discrepancy 
in  T-score  between  adjacent  vertebrae,  or  a  lack  of  increase 

in  BMC  or  bone  area  when  proceeding  caudally  from  Lx  to 

T  (12) 

j-,4. 

While  the  efforts  of  the  ISCD  represent  an  important 
advance  in  the  approach  to  BMD  interpretation,  interob¬ 
server  reproducibility  in  the  application  of  ISCD  criteria  for 
vertebral  body  exclusion  is  unknown.  We  hypothesized  that 
interpreters  could  achieve  good  consensus  in  applying  the 
ISCD  exclusion  criteria  and  determining  the  resultant  lum¬ 
bar  spine  T-score.  To  test  this  hypothesis,  we  analyzed 
agreement  among  observers  in  a  set  of  200  lumbar  spine 
DXA  scans. 
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FIG.  1.  Overall  and  FSD-related  vertebral 
exclusion  rates  by  observer  and  vertebra.  For 
vertebral  exclusion,  two-way  ANOVA  re¬ 
veals  significant  differences  according  to 
both  interpreter  and  vertebra,  without  inter¬ 
action  between  these  factors.  Posthoc  analy¬ 
sis  shows  significant  pairwise  differences  for 
all  comparisons  except  L,  vs.  Lj  and  observ¬ 
ers  2  vs.  3.  For  FSD,  two-way  ANOVA  re¬ 
veals  significant  differences  according  to 
both  interpreter  and  vertebra,  without  inter¬ 
action  between  these  factors.  Posthoc  analy¬ 
sis  shows  significant  pairwise  differences  for 
all  comparisons  except  L]  vs.  Lj,  L,  vs.  L3, 
and  observers  1  vs.  2  and  observers  3  vs.  4. 
Analysis  by  McNemar’s  test  yielded  similar 
results. 


MATERIALS  AND  METHODS 

Bone  densitometry 

De-identified  lumbar  spine  DXA  reports  of  200  male 
veterans  among  533  tested  at  the  William  S.  Middleton  Vet¬ 
erans  Hospital  between  January  2  and  October  5, 2002  were 
randomly  chosen  for  evaluation.  A  single  .ISCD-trained 
'technologist  performed  all  examinations  using  a  GEMedi- 
.  cal-  Systems  Lunar  Expert-XL  densitometer  (Madison,  WI, 
JJSA);Lfeta  were  analyzed  using  software  version  1.92  and 
•the'  GE  Lunar  male  normative  database. 

Subsequently,  four  ISCD-certified  physicians  reviewed 
the  DXA  images  and  ancillary  results  to  estimate  reproduc¬ 
ibility  of  the  ISCD  criteria  for  vertebral  body  exclusion. 
The  indication(s)  for  exclusion  of  vertebral  bodies  were 
noted,  along  with  the  resulting  lumbar  spine  T-score,  for 
each  report.  The  GE  Lunar  software  for  these  BMD  re¬ 
ports  did  not  permit  calculation  of  T-score  among  noncon¬ 
tiguous  vertebrae.  Thus,  if  two  adjacent  and  one  noncon¬ 
tiguous  vertebral  body  remained,  interpreters  were 
instructed  to  choose  the  mean  T-score  of  the  two  adjacent 
vertebrae  as  the  final  lumbar  spine  T-score,  as  suggested  by 
the  ISCD  position  Statement/125  When  two  nonadjacent 
vertebral  bodies  remained  after  application  of  the  exclusion 
criteria,  interpreters  were  asked  to  choose  the  numerically 
lower  T-score  as  the  final  value. 

All  study  data  were  entered  in  duplicate  into  an  Excel 
spreadsheet  for  statistical  analysis.  The  University  of  Wis¬ 
consin  Human  Subjects  Committee  and  the  William  S. 
Middleton  Veterans’  Hospital’s  Research  and  Develop¬ 
ment  Committee  reviewed  the  study  protocol  and,  as  all 
patient  identifiers  had  been  removed  from  the  data, 
deemed  that  informed  consent  was  not  needed  for  this 
study. 

Statistical  analysis 

To  determine  sample  size  for  this  study,  interpreters  2 
and  3  reviewed  a  random  sample  of  54  male  BMD  reports, 


applying  the  ISCD  criteria  for  vertebral  body  exclusion. 
Subsequently,  high  agreement  in  lumbar  spine  T-score  was 
achieved  (r  =  0.98,  Pearson’s  correlation  coefficient),  with 
an  average  difference  in  BMD  T-score  of  0.08  ±  0.4  be¬ 
tween  the  two  interpreters.  Based  on  these  data,  using  a 
two-sided  a  error  of  0.05,  review  of  200  BMD  reports  would 
provide  a  power  of  94%  to  detect  a  difference  of  0.1  in 
T-score  between  paired  interpreters. 

We  used  McNemar’s  and  k  test  statistics  to  assess  differ¬ 
ences  in  rates  of,  and  indications  for,  vertebral  body  exclu¬ 
sion  among  interpreters.  ANOVA  and  paired  and  unpaired 
r-tests  were  used  to  compare  continuous  variables,  While  the 
X2  test  was  used  to  compare  categorical  variables.  We  per¬ 
formed  linear  regression  to  compare  interpreters’  T-scores 
for  each  scan,  obtaining  the  Pearson  correlation  coefficient, 
prediction  equation,  and  SE  of  the  estimate  for  each  inter¬ 
preter  pair.  Significance  levels  were  adjusted  according  to 
Bonferroni’s  correction  to  account  for  multiple  compari¬ 
sons.  Statistical  analyses  were  performed  using  Analyze-It 
software  (Leeds,  UK),  Sigma  Stat  2.03  (SPSS),  and  Excel 
(Microsoft).  Calculation  of  k  was  based  on  formulas  ob¬ 
tained  from  the  Medi'eaT Algorithms  Project/135 

RESULTS 

Clinical  characteristics  of  the  study  population 

Male  veterans  in  this  study  had  a  mean  age  of  64  ±  12 
years  (range,  25-89  years).  Most  were  white  (97%),  with 
two  Hispanics  and  four  blacks  among  the  group.  Tire  aver¬ 
age  body  mass  index  was  29.1  kg/m2  (range,  17.4-45.1  kg/ 
m2).  Current  and  prior  alcohol  use  was  reported  among  39 
(20%)  and  27  (14%)  men,  whereas  current  and  prior  to¬ 
bacco  use  was  noted  in  33  (17%)  and  66  (33%)  men,  re¬ 
spectively.  Among  all  men,  98  (49%)  reported  a  prior  frac¬ 
ture,  including  95  (48%)  with  low  trauma  fractures. 

Vertebral  body  exclusion 

Figure  1  shows  vertebral  exclusion  by  the  various  observ¬ 
ers,  and  two  trends  are  apparent.  First,  the  rate  of  vertebral 
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Table  1.  Agreement  Among  Interpreters  by  k  Statistic* 


Average  k  (range) 

L, 

Exclusion 

0.350  (0.256, 0.446) 

FSD 

0.359(0.263,0.434) 

T-score  discrepancy 

0.505  (0.416, 0.668) 

Bone  area 

0.138  (-0.039, 0.264) 

BMC 

0.337  (0,081, 0.615) 

L> 

Exclusion 

0.350  (0.215, 0.468) 

FSD 

0.383  (0.211, 0.558) 

T-score  discrepancy 

0.547  (0.291, 0.779) 

Bone  area 

0.312  (0.167, 0.642) 

BMC 

0.329  (0.167, 0.517) 

l3 

Exclusion 

0.430  (0.278, 0.496) 

FSD 

0.342  (0.145, 0.523) 

T-score  discrepancy 

0.670  (0.552, 0.809) 

Bone  area 

0.503  (0.377, 0.851) 

BMC 

0.148  (-0.005, 0.433) 

Exclusion 

0.500  (0.370, 0.567) 

FSD 

0.407  (0.229, 0.519) 

T-score  discrepancy 

0.687  (0.598, 0.798) 

Bone  area 

0.652  (0.468, 0.847) 

BMC 

0.326  (-0.003, 0.665) 

*Cut-points  for  kk^O,  poor  agreement;  0<k<  0.2,  slight  agreement; 
0.2  <  k  s  0.4,  fair  agreement;  0.4  <  k  £  0.6,  moderate  agreement;  0.6  <  k 
£  0.8,  substantial  agreement  0.8  <  k,  excellent  agreement. 


body  exclusion  rises  from  Lx  to  L4.  Second,  each  observer 
figs  a  tharagteristic  threshold  for  excluding  vertebrae.  For 
.*  eiainple^bserveF  1  excluded  the  fewest  vertebrae, 
whereaSvcfoserver  4  excluded  the  most  vertebrae,  at  any 
given  level.  Bpth  tr£nds  are  significant  by  two-way 
ANOVA,  with  p  <‘0.0001  for  vertebrae  and  p  =  0:0016  for 
observers,  without  a  significant  interaction  between  verte¬ 
brae  and  observers.  Post  hoc  analysis  by  Tukey’s  test  re¬ 
veals  significant  pairwise  differences  in  vertebral  body  ex¬ 
clusion  between  all  vertebrae  except  Li  and  L2  and 
between  all  interpreters  except  2  and  3. 

Differences  in  BMD  interpretation  among  the  interpret¬ 
ers  are  also  reflected  by  the  number  of  scans  deemed  un¬ 
readable.  Lumbar  spine  images  were  completely  excluded 
from  analysis  in  4  cases  by  interpreter  1,  2  cases  by  inter¬ 
preter  2,  16  cases  by  interpreter  3,  and  24  cases  by  inter¬ 
preter  4.  The  frequency  of  unreadable  scans  is  significantly 
different  among  observers  (x2  =  29.8,  df  =  3,  p  <  0.0001). 
Pairwise  comparisons  are  significant  for  observers  2  and  4 
(x2  =  13.9,  df  =  1,  p  =  0.0002),  observers  1  and  4  (x2  = 
18.1,  df  =  1  ,p  =  0.00002),  and  observers  1  and  3  (x2  =  9.8, 
df  =  1,  p  =  0.0017),  whereas  the  difference  is  of  marginal 
significance  for  observers  2  and  3  (x2  =  6.4,  df  -  1,  p  = 
0.012). 

We  used  the  k  statistic  to  gauge  agreement  among  four 
observers  regarding  vertebral  body  exclusion  and  regarding 
application  of  each  of  the  four  ISCD  exclusion  criteria 
(Table  1).  k  values  range  from  -0.039,  indicating  no  agree¬ 
ment,  to  0.851,  indicating  excellent  agreement.  However, 
the  majority  of  k  values  fall  between  0.2  and  0.6,  indicating 
fair  to  moderate  agreement  between  obseivers. 


FIG.  2.  A  DXA  image  showing  disagreement  regarding  exclu¬ 
sion  because  of  the  presence  of  vertebral  abnormalities.  Inter-, 
preter  3  excluded  L,  only,  interpreters  1  and  2  excluded  L,  and  Lj, 
and  interpreter  4  excluded  all  vertebral  bodies,  judging  this  scan 
unreadable. 


The  surprisingly  poor  agreement  among  observers 
prompted  us  to  seek  the  factors  leading  to  different  choices 
regarding  vertebral  body  exclusion.  Two  factors  account  for 
most  of  the  discrepancy  among  observers:  differing  thresh¬ 
olds  for  vertebral  exclusion  because  of  FSD  and  differing 
choice  of  which  vertebra  among  a  discordant  pair  to  ex¬ 
clude. 

FSDs 

The  four  observers  exhibited  differing  judgment  regard¬ 
ing  the  presence  of  FSD  (Fig.  1).  An  example  of  disparate 
exclusion  based  on  FSD  is  shown  in  Fig.  2.  In  this  case, 
interpreter  3  excluded  Lj  only,  interpreters  1  and  2  ex¬ 
cluded  L[  and  L2,  and  interpreter  4  excluded  all  vertebral 
bodies  because  of  FSD,  resulting  in  an  unreadable  scan.  We 
analyzed  FSD  by  two-way  ANOVA-,  finding  significant  dif¬ 
ferences  in  FSD  frequency  related  to  vertebra  (p  ~ 
0.00013)  and  observer  (p  =  0.00009)  without  interaction 
between  these  factors.  Posthoc  analysis  by  Tukey’s  test  re¬ 
veals  significant  pairwise  differences  for  all  vertebral  com¬ 
parisons  except  L,  versus  L2  and  Lj  versus  L,  and  for  all 
observer  pairs  except  1  versus  2  and  3  versus  4.  Overall, 
observer  4  had  the  lowest  threshold  for  calling  a  FSD,  fol¬ 
lowed  by  observers  3, 2,  and  1. 

In  general,  increasing  patient  age  was  associated  with  less 
consistent  agreement  regarding  the  presence  of  FSD.  For 
example,  between  interpreters  1  and  2,  the  mean  age  of 
men  for  whom  full  agreement  regarding  the  presence  of 
FSD  was  achieved  was  61.2  ±  12  years.  Conversely,  among 
men  for  whom  disagreement  regarding  FSD  was  noted,  the 
mean  age  was  68.8  ±  9.6  years  (p  <  0.0001  for  comparisons 
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Table  2  Agreement  Regarding  FSDs  and  Impact  on  Mean  T-Score 


Mean  T-score  ±  SD 
lower-numbered 
obseiver 

Mean  T-score  ±  SD 
higher-numbered 
observer 

T-score  p  value* 

Age  p  valuer 

Interpreters  1  and  2 

Agree,  n  —  111 

Age  61.2  ±  12.3  years 

-1.49  ±1.37 

-1.50  ±1.34 

0.78 

<0.0001 

Disagree,  n  =  89 

Age  68.8  ±  9.6  years 
Interpreters  1  and  3 

-1.42  ±  1.60 

-1.25  ±1.53 

0.0001 

Agree,  n  =  64 

Age  66.6  ±  10.9  years 

-1.54  ±1.42 

-1.58  ±  1.36 

0.0619 

0.0006 

Disagree,  n  =  136 

Age  60.1  ±  12.5  years 
Interpreters  1  and  4 

-1.38  ±  1.46 

-1.76  ±  1.41 

<0.0001 

Agree,  n  =  54 

Age  60.2  ±  12.3  years 

-1.65  ±  1.35 

-1.67  ±  1.37 

0.65 

0.0014 

Disagree,  n  =  146 

Age  66.2  ±  11.2  years 
Interpreters  2  and  3 

-1.26  ±1.48 

-1.75  ±  1.40 

<0.0001 

Agree,  n  =  89 

Age  63,-0  ±  1’2.3  years 

-1.53  ±  1.49 

-1.54  ±1.49 

0.8291 

0.0911 

Disagree,  n  =  111 

Age  65,8  ±11.3  years 
Interpreters  2  and  4 

-1.58  ±  1.39 

-1.79  ±  1.37 

0.0003 

Agree,  n  =  74 

Age  63.5  ±  12.7  years 

-1.68  ±  1.51 

-1.69  ±  1.54 

0.93 

0.360 

Disagree,  n  =  126 

Age  65.13  ±  11.2  years 
Interpreters  3  and  4 

-1.32  ±  1.44 

-1.75  ±  1.28 

0.0001 

Agree,  n  =  83 

Age  62.5  ±  12.2  years 

-1.70  ±1.52 

-1.75  ±  1.50 

0.0875 

0.0459 

Disagree,  n  =  117 

-1.93  ±  1.17 

-1.83  ±  1.20 

0.0929 

, ;  Age  ‘66:0  ±  11.4  years 

•  *l‘f  fired  r-test,  significant  values  are  shown  in  bold. 

.  *  Unpaired  r-test,  significant  values  are  shown  in  bold. 


of  age  by  independent  t-test).  Table  2  provides  additional 
comparisons  for  age  and  agreement  regarding  FSD. 

Exclusion  criteria  based  on  comparison  of 
adjacent  vertebrae 

A  T-score  discrepancy  of  >1  SD  unit  among  adjacent 
vertebrae  and  a  lack  of  increase  in  BMC  or  area  proceeding 
caudally  from  Lt  to  L4  are  each  cause  for  excluding  a  ver¬ 
tebral  body.  These  exclusion  criteria  differ  from  the  pres¬ 
ence  of  FSD  because  they  are  applied  to  vertebral  pairs, 
rather  than  to  single  vertebrae.  Moreover,  once  a  pair  has 
been  identified  as  warranting  exclusion  of  one  vertebral 
body,  the  interpreter  must  still  decide  which  vertebral  body 
to  exclude. 

The  k  data  in  Table  1  reveal  that  there  was  greater  agree¬ 
ment  among  observers  for  these  three  comparative  exclu¬ 
sion  criteria  than  for  FSD.  Moreover,  good  agreement  was 
achieved  between  interpreters  (corrected  McNemar’s  test 
not  significant  for  all  but  5  comparisons  among  72)  when 
applying  the  three  exclusion  criteria.  However,  despite  the 
apparent  objectivity  of  the  numeric  exclusion  criteria,  in¬ 
terpreters  differed  in  their  application  of  these  criteria.  In 
Table  3  we  show  one  such  example,  with  interpreters  1  and 


4  excluding  Lj  and  L4,  whereas  interpreters  2  and  3  ex¬ 
cluded  Lj,  L3,  and  L4  with  application  of  the  three  com¬ 
parative  exclusion  criteria. 

Therefore,  given  the  observer’s  choice  regarding  which  of 
two  paired  vertebrae  to  exclude,  we  investigated  whether 
scoring  agreement  by  tire  vertebral  pair,  rather  than  by  the 
single  vertebra,  would  reveal  greater  concordance  among 
observers  by  k.  Because  exclusion  of  Lj  or  L3  could  be  due 
to  comparison  with  either  tire  vertebra  above  or  below  it, 
we  limited  this  additional  analysis  to  the  I^-Lj  and  the 
L3— L4  pairs.  The  corrected  k  for  the  three  comparative  ex¬ 
clusion  criteria  are  shown  in  Table  4.  With  adjustment  for 
arbitrarily  choosing  the  upper  or  lower  of  two  vertebral 
bodies,  the  average  k  among  paired  interpreters  increased 
by  an  average  of  0.195  to  a  mean  k  of  0.629,  indicating 
substantial  agreement.  Moreover,  12  of  36  corrected  com¬ 
parisons  yielded  k  >  0.8,  indicating  excellent  agreement 
among  observers. 

Impact  of  vertebral  exclusion  on  T-score 

Given  that  observers  differ  in  their  application  of  the 
ISCD  vertebral  exclusion  criteria,  it  is  important  to  deter¬ 
mine  the  impact  of  interobserver  differences  on  final  T- 
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Table  3.  Example  of  Interpreter  Disagreement  Regarding  Vertebral  Exclusion  for  Comparative  Criteria 


Vertebra 

BMD 

(g/cm2) 

T-score 

Bone  area 
(cm2) 

BMC 

Interpreters'  decisions 

L, 

0.815 

-2.9 

15.7 

12.8 

Interpreters  2  and  3  exclude  for  T-score 
discrepancy  and  lack  of  increase  in  BMC 

4 

0.722 

-4.3 

17.2 

12.4 

Interpreters  1  and  4  exclude  for  T-score 
discrepancy  and  lack  of  increase  in  BMC 

4 

0.954 

-2.4 

18.2 

17.4 

Interpreters  2  and  3  exclude  for  T-score 
discrepancy 

4 

0.999 

-2.0 

14.9 

14.9 

All  interpreters  exclude  for  lack  of  increase 
bone  area  and  BMC 

* 


Table  4.  Improvement  in  k  Statistic  After  Correction  for 
Paired  Vertebral  Analysis 


Mean  k  statistic*  Mean  improvement 
for  paired  in 

vertebrae  (range)  (range) 


Lj/Lj  f 

T-score  discrepancy 
Bone  area 
BMC 

L3/L4 

T-score  discrepancy 
Bone  area 
BMC 


0.725  (0.559, 0.910) 
0.373  (0.149, 0.865) 
0.603  (0.465, 0.881) 

0.832  (0.756, 0.881) 
0.749(0.609,0.927) 
0.496  (0.124, 0.835) 


0.220  (0.059, 0.318) 
0.272  (0.106, 0.601) 
0.266  (0.136, 0.343) 

0.145  (0.087, 0.201) 
0.097  (0.036, 0.182) 
0.170  (0.090, 0.255) 


"Cut-points  for  k:  kSO,  poor  agreement;  0  <  k  £  0.2,  slight  agreement; 
0.2  <  k  £  0.4,  fair  agreement;  0.4  <;  k  £  0.6,  moderate  agreement;  0.6  <  k 
£.0.8,  substantial  agreement;  0.8  <  k,  excellent  agreement. 

,  "Improvement  relative  to  L,  for  Lj/l?  pair  and  relative  to  L4  for  L3/L4 
pair!  "  : 


scores.  Excluding  those- studies  deemed  not  assessable  by  at 
least  one  .interpreter  {n  =  36),  the  mean  lumbar  spine  T- 
score  was  -1.55  ±  1-.32  for  interpreter  1,  -1.70  ±  1.29  for 
interpreter'2,  -1.82  ±  1.29  for  interpreter  3,  and  -1.85  ±  1.29 
for  interpreter  4  (p  >  0.05  by  ANOVA  and  subsequent 
pairwise  comparisons).  For  all  200  BMD  reports,  the  mean 
lumbar  spine  T-score  was  -1.36  ±  1.46  for  interpreter  1, 
-1.45  ±  1.48  for  interpreter  2,  -1.68  ±  1.43  for  interpreter  3, 
and  -1.73  ±  1.40  for  interpreter  4  (p  >  0.05  by  ANOVA  and 
subsequent  pairwise  comparisons). 

Using  linear  regression  to  assess  the  relationship  of  lum¬ 
bar  spine  T-scores  between  observer  pairs,  we  determined 
regression  equations,  correlation  coefficients,  and  SE  of  the 
estimate.  Scatter  plots  and  summarized  statistics  are  shown 
in  Fig.  3.  All  correlation  coefficients  exceeded  0.88.  The 
mean  difference  (A)  and  SE  of  the  estimate  in  lumbar  spine 
T-score  among  reports  and  between  paired  observers 
ranged  from  0.05  and  0.47  (interpreters  3  and  4)  to  0.28  and 
0.63  (interpreters  1  and  4),  respectively.  The  SE  of  the  es¬ 
timate  is  a  measure  of  the  distance  between  an  average  data 
point  and  the  regression  line  and  provides  an  additional 
measure  of  agreement  among  observers. 

Despite  good  correlation  in  T-score,  review  of  Fig.  3 
shows  that,  between  interpreters,  the  final  diagnostic  cat¬ 
egory  may  differ  substantially.  Overall,  changes  in  diagnos¬ 
tic  category  ranged  from  18  (observers  3  and  4)  to  30  pa¬ 
tients  (observers  1  and  4).  For  example,  osteoporosis. 


osteopenia,  and  normal  bone  mass  were  diagnosed  by 
DXA  in  50, 66,  and  80  men  by  interpreter  1  and  54, 69,  and 
53  men  by  interpreter  4.  Among  176  men  with  T-scores 
provided  by  both  interpreters,  30  (17%)  experienced  a 
change  in  diagnostic  category  between  interpreters,  moving 
down  to  a  lower  bone  mass  category  in  all  but  six  cases  with 
interpreter  4  compared  with  interpreter  1. 

We  found  that  interobserver  disparities  in  T-scores  could 
be  attributed  largely  to  disagreement  regarding  the  pres¬ 
ence  of  FSD.  For  example,  interpreters  2  and  4  agreed  fully 
on  the  presence  of  FSD  among  four  vertebral  bodies  for  76 
reports,  and  the  mean  T-score  was  -1.68  ±  1.51  and  -1.69  ± 
1.54  for  these  reports,  respectively  (p  =  0.93  by  paired 
I- test).  Conversely,  for  the  BMD  reports  with  less  than  full 
agreement  regarding  FSD  (n  =  124),  interpreters  2  and  4 
had  significantly  different  mean  T-scores  (-1.32  ±  1.44  and 
-1.75  ±  1.28,  respectively,  p  =  0.0001).  Complete  data  are 
summarized  in  Table  2. 

DISCUSSION 

Although  bone  turnover  markers,  ultrasound,  and  quan¬ 
titative  CT  scanning  each  predict  fragility  fracture/5-105 
BMD  measured  by  DXA  remains  the  gold  standard  for 
osteoporosis  diagnosis  and  fracture  risk  assessment  in  clini¬ 
cal  practice/1-5,125  However,  DXA  is  an  imperfect  tool  for 
various  reasons  including  artifacts  such  as  degenerative  ar¬ 
thritis,  aortic  calcification,  and  scoliosis/11,125  Because 
DXA  is  widely  used  to  diagnose  and  manage  metabolic 
bone  disease,  methods  that  improve  tire  quality  of  DXA 
interpretation  will  thereby  improve  patient  care.  Indeed, 
consistent  interpretation  is  crucial  to  quality  patient  care 
and  the  conduct  of  multicenter  studies  using  BMD  end¬ 
points.  As  standardization  of  diagnostic  criteria  for  verte¬ 
bral  fracture  has  proven  beneficial  to  clinical  investigation 
and  patient  care/14-195  a  similar  benefit  may  be  anticipated 
from  successful  standardization  of  BMD  interpretation. 

The  ISCD  has  taken  an  important  first  step  in  standard¬ 
izing  BMD  interpretation,  by  developing  criteria  for  verte¬ 
bral  body  exclusion.  However,  whereas  the  exclusion  crite¬ 
ria  seem  straightforward,  we  have  shown  that  their 
application  is  problematic  and  variable,  even  among  ISCD- 
certified  physicians.  Contrary  to  our  expectation  of  good 
agreement  among  interpreters,  we  find  that  agreement  gen¬ 
erally  falls  into  the  moderate  range  and  leads  to  disagree¬ 
ment  in  diagnostic  classification.  This  disagreement  among 
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observers  is  secondary  to  differing  thresholds  for  recogniz¬ 
ing  FSD,  with  a  trend  toward  a  lower  final  T-score  among 
interpreters  with  lower  thresholds  for  noting  FSD  and  in¬ 
consistent  exclusion  of  either  the  upper  or  lower  vertebra 
from  a  discordant  pair.  Nevertheless,  despite  differences  in 
vertebral  exclusion,  a  strong  linear  relationship  between 
interpreters  is  noted  with  respect  to  lumbar  spine  T-scores. 

Determining  when  to  exclude  a  vertebral  body  because 
of  a  FSD  is  the  most  subjective  of  the  four  ISCD  exclusion 
criteria.  Thus,  it  is  not  surprising  that  the  greatest  disagree¬ 
ment  between  interpreters  was  observed  when  applying  this 
exclusion  criterion.  We  sought,  but  did  not  find,  evidence 
suggesting  that  differing  thresholds  for  FSD  are  related  to 
experience  (data  not  shown).  Moreover,  when  we  reviewed 
the  data  reported  herein,  disagreements  among  us  focused 
on  whether  FSD  were  of  sufficient  severity  to  warrant  ver¬ 
tebral  exclusion  rather  than  on  whether  a  FSD  was  present. 
Our  data  suggest  that  progress  in  standardizing  DXA  in¬ 
terpretation  therefore  will  require  establishment  of  uniform 
FSD  definitions.  Such  standardization  might  be  accom¬ 
plished  in  any  of  several  ways,  each  of  which  has  advantages 
and  disadvantages.  One  possible  approach  is  to  develop  and 
use  an  atlas  of  FSD  when  training  densitometrists.  A  sec¬ 
ond  potential  strategy- would  require  that  vertebral  body 
exclusion  because  of  FSD  be  accompanied  by  an  ancillary 
exclusion  criterion,  such  as  an  unusual  discrepancy  in  T- 
score.  Third,  plain  radiographs  could  be  used  to  confirm  an 
FSD  before  excluding  the  vertebral  body.  Finally,  software 
is  under  development  to  identify  differences  in  the  digital 
'  image  pixel  density  of  each  vertebra,  allowing  such  differ¬ 
ences  to-be  .highlighted 'as  potential  FSD  in  the  printed 
report.  Thus,  whereas  reaching  consensus  on  FSD  may  be  a 
.  difficult;  it  is  achievable  in  principle  with  adequate 
discussion  among  the  bone  community. 

.  The  other  three  ISCD  exclusion  criteria  are  based  on 
numerical  data;  the  only  subjective  decision  is  which  of  two 
“vertebral  bodies  to  exclude.  For  example,  if  there  is  a  lack 
of  increase  in  bone  area  from  Li  to  Lj,  the  interpreter  could 
choose  to  exclude  L,_  or  L2.  Such  minimal  subjectivity  is 
likely  responsible  for  the  stronger  agreement  between  ob¬ 
servers  when  applying  these  three  exclusion  criteria.  Nev¬ 
ertheless,  we  found  that  agreement  among  observers  is  bet¬ 
ter  still  if  one  corrects  for  arbitrarily  excluding  one  or  the 
other  of  a  discordant  vertebral  body  pair.  Specific  guidance 
regarding  which  vertebra  of  a  pair  should  be  excluded  is 
therefore  an  important  step  in  refining  the  vertebral  exclu¬ 
sion  criteria.  Such  guidance  could  be  incorporated  into 
bone  density  software,  which  could  in  turn  highlight  discor¬ 
dant  pairs  in  the  printed  report,  eliminating  the  possibility 
of  arithmetic  errors  and  allowing  an  interpreter  to  exclude 
vertebral  bodies  with  more  consistency. 

Our  study  has  several  important  limitations.  Only  DXA 
scans  of  predominantly  white  male  veterans  were  evalu¬ 
ated.  Reproducibility  may  be  different  among  women  and 
among  subjects  with  more  varied  ethnic  backgrounds.  An¬ 
other  limitation  of  our  study  is  the  use  of  a  single  densi¬ 
tometer  and  technician;  the  performance  characteristics  of 
different  technicians  and  instruments  may  affect  scan  per¬ 
formance.  Most  importantly,  the  data  presented  here  offer 
no  guidance  regarding  “best  practice”  in  applying  the  ISCD 


exclusion  criteria.  It  remains  an  open  question  whether  the 
exclusion  criteria  in  general,  and  the  various  thresholds  for 
recognizing  FSD  in  particular,  improve  the  ability  of  spinal 
BMD  to  predict  fracture.  Such  guidance  will  ultimately  re¬ 
quire  that  various  interpretation  schemes  be  prospectively 
related  to  incident  fractures.  Other  areas  for  future  study 
include  validation  of  our  findings  in  different  patient  groups 
and  a  more  systematic  analysis  of  reader  experience  on 
DXA  interpretation. 

We  conclude  that,  despite  their  apparent  simplicity,  the 
ISCD  criteria  for  vertebral  body  exclusion  are  difficult  to 
apply  consistently.  Differences  in  interpretation  are  caused 
by  differing  thresholds  for  FSD  and  choice  of  which  verte¬ 
bra  of  a  discordant  pair  to  exclude.  Refinement  of  exclusion 
criteria  can  overcome  both  sources  of  disagreement. 
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Abstract.  Dual  energy  X  ray  absorptiometry  (DXA)  has 
become  a  popular  analytical  technique  in  mice  and  other 
small  animals.  Comparative  study  of  bone  properties  at 
different  anatomical  sites  is  an  active  area  of  study  in 
model  organisms.  Such  investigations  require  that  site- 
specific  data  be  generated  and  interpreted.  There  are  no 
published  data  addressing  the  degree  to  which  contra¬ 
lateral  mouse  bones  resemble  each  other  in  the  absence 
of  an  experimental  intervention,  nor  are  there  data 
addressing  the  correlation  of  bone  densitometry  mea¬ 
surements  between  anatomically  distant  sites.  To  ad¬ 
dress  these  gaps  in  our  knowledge,  we  used  DXA  to 
compare  excised  mouse  femora  and  humeri.  At  the 
population  level,  left  bones  were  slightly  but  signifi¬ 
cantly  denser  than  right  bones,  with  an  overall  adjusted 
bone  mineral  density  (BMD)  difference  of  0.7  ±  0.3  and 
0.5  ±  0.2  mg/cm2  at  the  femur  and  humerus,  respec¬ 
tively.  At  the  level  of  bone  pairs  from  a  single  animal, 
absolute  adjusted  BMD  disparities  between  the  right 
and  left  sides  were  2.3  ±1.9  mg/cm2  at  the  femur  and 
1.7  ±  1.4  mg/cm2  at  the  humerus.  Correlation  coeffi¬ 
cients  between  left  and  right  sides  were  0.78  for  adjusted 
BMD  at  both  sites.  The  correlation  coefficient  between 
side-averaged  femoral  and  humeral  BMD  was  0.81,  but 
ranged  between  0.70  and  0.75  when  limited  to  ipsilateral 
or  contralateral  femur-humerus  pairs.  Our  findings 
suggest  the  desirability  of  randomizing  limbs  for  treat¬ 
ment  in  studies  using  contralateral  limb  controls.  These 
observations  may  represent  the  densitometric  manifes¬ 
tation  of  behavioral  and  neuroanatomical  lateralization 
in  laboratory  mice. 

Key  words:  Bone  density  technology  —  Bone  develop¬ 
ment  —  Osteoporosis  of  animal  models 


Dual  energy  X-ray  absorptiometry  (DXA)  has  the  most 
widely  used  imaging  methodology  to  assess  risk  of  fra¬ 
gility  fracture.  In  addition  to  its  documented  ability  to 
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stratify  patients  for  fracture  risk,  examination  is  accu¬ 
rate,  precise,  noninvasive,  and  entails  only  modest 
radiation  exposure.  Moreover,  examination  is  feasible  at 
the  lumbar  spine,  hip,  and  wrist,  corresponding  to 
common  fracture  sites.  These  features  make  DXA  an 
attractive  choice  for  diagnosing  osteoporosis,  following 
patients  to  monitor  effectiveness  of  therapy  for  osteo¬ 
porosis,  and  as  a  surrogate  endpoint  for  fracture  in 
clinical  investigation.  Correspondingly,  DXA  has  be¬ 
come  a  popular  analytical  technique  in  mice  and  other 
small  animals,  either  as  a  substitute  for  more  demanding 
techniques,  such  as  biomechanical  testing,  histomorph- 
ometry,  and  gravimetry,  or  as  an  adjunct  to  them. 

Recent  progress  in  skeletal  genetics  and  success  in 
continuing  efforts  to  develop  bone  pharmaceuticals  have 
been  achieved  in  large  part  through  studies  performed  in 
rodents.  As  small  animal  skeletal  research  continues, 
more  detailed  questions  regarding  differences  in  the 
behavior  of  cortical  and  trabecular  bone  and  at  different 
anatomical  sites  are  becoming  ever  more  prominent.  For 
example,  several  laboratories  have  devised  methods  for 
imposing  investigator-specified  loading  (e.g.  [1-8])  and 
unloading  (e.g.  [9-13])  regimens  on  a  single  limb,  using 
the  contralateral  limb  as  a  control.  Similarly,  experi¬ 
mental  investigations  of  fracture  healing  (e.g.  [14-20])  in 
small  animal  models  routinely  use  contralateral  limb 
controls.  Such  experiments  require  that  site-specific  data 
be  generated  and  interpreted. 

As  in  humans,  site-specificity  of  bone  mineral  density 
(BMD)  has  been  reported  in  mice  [21-24],  Regardless  of 
species,  site-specificity  has  been  sought  primarily  in 
comparisons  of  skeletal  sites  with  marked  differences  in 
the  proportion  of  cortical  to  trabecular  bone.  Therefore, 
little  is  known  regarding  the  extent  to  which  anatomi¬ 
cally  distant  cortical  bone  sites  resemble  each  other 
densitometrically.  Existing  rodent  site-specificity  data 
are  therefore  insufficient  to  help  guide  interpretation  of 
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comparative  densitometry  of  left  and  right  bones,  as 
there  are  no  published  data  addressing  the  degree  to 
which  contralateral  bones  are  similar  in  the  absence  of 
an  experimental  intervention.  In  order  to  address  these 
gaps  in  our  knowledge,  we  undertook  a  comparison  of 
excised  mouse  femora  and  humeri  using  DXA. 

We  report  here  that  right-left  correlations  at  each  of 
these  sites  are  high,  as  are  intersite  correlations  using 
side-averaged  data.  These  findings  demonstrate  the  im¬ 
pact  of  region  of  interest  analyses  on  measurement 
precision  relative  to  whole-body  analyses.  Moreover, 
our  experiments  revealed  that  left  bones  have  slightly 
but  significantly  greater  BMD  than  right  bones.  This 
unexpected  observation  has  practical  implications  for 
mouse  DXA  study. 


Materials  and  Methods 

Mice 

The  mice  in  this  study  were  FI  and  F2  offspring  of  a  reciprocal 
intercross  between  recombinant  congenic  strains  HcB/13Dem 
and  HcB/14Dem  [25,  26].  The  parental  strains  were  the  most 
divergent  of  24  recombinant  congenic  strains  for  failure  load 
[26],  and  the  DXA  data  reported  here  were  obtained  in  the 
course  of  a  linkage  study  for  biomechanical  performance  using 
these  strains.  In  our  original  survey  of  the  HcB  recombinant 
congenics,  HcB/13  and  HcB/14  femora  had  ash  percentages  of 
67.0  ±  0.7%  and  70.1  ±  0.9%  (mean  ±  s.d.),  respectively  [26], 
Subsequent  ashing  and  DXA  experiments  confirm  the  initial 
observation  (not  shown).  Included  were  98  males  and  101  fe¬ 
males  from  the  HcB/13  X  HcB/14  arm  and  103  males  and  102 
females  from  the  HcB/14  X  HcB/13  arm  of  the  cross.  Both  FI 
and  F2  animals  were  included,  and  both  sexes  were  pooled  for 
analysis.  Animals  were  housed  at  2-5  animals/500  cm2  cage  on 
corncob  bedding,  fed  laboratory  rodent  chow  5001  (PMI 
Nutrition  International,  Richmond,  IN)  and  tap  water  ad  lib 
and  exposed  to  12  hour  light/ 12  hour  dark  cycle.  Animals  were 
sacrificed  at  17  ±  1  weeks  of  age  by  C02  asphyxiation. 
Femora,  humeri,  and  radii  were  dissected  free  of  soft  tissue 
immediately  following  sacrifice,  wrapped  in  phosphate-buf¬ 
fered  saline-saturated  gauze,  and  stored  at  -70°C  until  anal¬ 
ysis.  The  animal  protocol  was  approved  by  the  University  of 
Wisconsin  and  the  William  S.  Middleton  Veterans’  Hospital 
IACUCs. 

DXA  Scanning 

Dissected  bones  were  scanned  using  the  PIXImus  (GE  Lunar, 
Madison,  WI)  densitometer  and  software  version  1.45.023.  We 
scanned  a  manufacturer-provided  phantom  daily  during  data 
collection  to  confirm  the  instrument’s  calibration  and  preci¬ 
sion.  Bones  were  allowed  to  equilibrate  to  room  temperature 
prior  to  scanning.  All  bones  from  a  single  animal  were 
examined  in  the  same  scan.  Bones  were  positioned  manually  in 
air  on  a  7  mm  thick  Plexiglas  platform  provided  by  the 
instrument  manufacturer  to  achieve  similar  orientations  for 
right  and  left  bones.  Acquisition  time  per  scan  was  approxi¬ 
mately  5  minutes.  The  primary  analysis  was  performed  using 
software  version  1.45.023.  We  manually  adjusted  regions  of 
interest  to  analyze  each  of  the  bones  individually,  excluding 
any  bones  with  edges  that  were  incorrectly  identified  by  the 
software.  We  adjusted  for  bone  position  by  multiple  linear 
regression  as  described  [27],  Briefly,  the  adjustment  algorithm 
was  developed  by  scanning  a  reference  set  of  bones  at  varied 
positions  within  the  PIXImus  field  prior  to  ashing.  BMD  was 


then  modeled  by  stepwise  multiple  linear  regression  and  the 
final  model  included  mineral  content  by  ashing,  X-coordinate 
within  the  scanning  field,  and  Y-coordinate  within  the  scan¬ 
ning  field  as  significant  independent  variables,  while  distance 
from  the  center  of  the  scanning  field  was  not  a  significant 
independent  variable.  For  the  adjustment  used  here  (software 
version  1 .45  in  air),  the  overall  R 2  value  is  0.95  and  the  A  R1 
obtained  by  adjusting  for  X  and  Y  coordinates  are  0.027  and 
0.003,  respectively. 

Statistics 

Statistical  calculations  were  performed  using  SigmaStat  for 
Windows,  version  2.03  (SPSS,  Chicago,  IL).  Accrued  data 
were  tested  for  normality  by  the  Kolmogorov-Smirnov  test. 
Comparisons  between  left  and  right  sides  were  made  by  paired 
Student’s  t  test.  Categorical  comparisons  were  performed  by 
X2.  Comparisons  between  different  skeletal  sites  were  analyzed 
by  Pearson  product  moment  correlation  or  Spearman’s  rank 
order  correlation.  Significance  level  was  P  <  0.05  and  adjusted 
by  Bonferroni’s  correction  to  account  for  multiple  compar¬ 
isons.  Data  are  reported  as  mean  ±  standard  deviation  unless 
noted  otherwise. 

Results 

Data  obtained  from  403  animals  were  included  in  this 
study,  including  401  pairs  of  femora,  and  394  pairs  of 
humeri.  The  number  of  pairs  of  bones  studied  at  each 
site  is  less  than  the  total  number  of  animals  because  of 
breakage  or  loss.  Descriptive  statistics  of  BMD,  BMC, 
and  bone  area  revealed  that  while  adjusted  BMD  and 
BMC  are  normally  distributed  at  both  skeletal  sites, 
bone  area  is  skewed  at  both  ( P  <  10-5),  with  a  greater 
fraction  of  the  distribution  having  larger  than  modal 
bone  areas  regardless  of  side,  sex,  or  cross  arm. 

Left-sided  Bones  are  Denser  Than  Right  sided  Bones 

At  both  sites,  BMD  was  significantly  higher  for  the  left 
than  for  the  right  side  (Table  1).  The  side  disparities  did 
not  arise  as  a  result  of  anomalous  data  from  a  relatively 
small  number  of  bones,  but  reflect  a  population-wide 
pattern.  We  compared  the  number  of  mice  in  which  the 
left  value  was  greater  and  the  number  of  mice  in  which 
the  right  value  was  greater  for  each  measurement  and 
each  site  and  found  significantly  more  left  values  to  be 
greater  for  BMD  at  both  sites.  Left-right  disparities  of 
BMC  mirrored  the  BMD  findings,  while  bone  area  did 
not  differ  significantly  between  the  left  and  right  sides  at 
either  site  (not  shown). 

Intersite  BMD  Correlations 

Scatterplots  relating  left  and  right  BMD  at  the  femur 
and  humerus  are  shown  in  Figure  1.  These  data  show 
equal,  moderately  strong  correlations  of  0.78  ( P  <  10-5) 
between  sides  at  both  sites. 

Our  original  goal  in  performing  DXA  on  several  long 
bones  of  the  same  animals  was  to  determine  how  highly 
correlated  BMD  was  at  the  various  sites.  These  corre- 


28 


G.  E.  Lopez  Franco  et  al.:  DXA  of  ex  vivo  Mouse  Long  Bones 


T-able  1.  Comparison  of  DXA  measurements  of  left  and  right  bones  at  2  sites 


Femur 

Humerus 

Left 

Right 

Left 

Right 

Adjusted  BMD  (mg/cm2) 

Mean  A  (mg/cm2) 

Mean  Abs  A  (mg/cm2) 

57.0  ±  4.4 

P  <  10“5* 

0.7  ±  0.3 

2.3  ±1.9 

56.3  ±  4.2 

42.2  ±  3.4 

P  =  3  x  10~5 

0.5  ±  0.2 

1.7  ±  1.4 

41.7  ±  3.3 

#  greater + 

256 

P  <  10“3** 

155 

240 

P  =  1.5  x  10-5** 

154 

Mean  A  =  mean  (difference  between  left  and  right  bones) 

Mean  Abs  A  =  mean  absolute  value  (difference  between  left  and  right  bones) 

*Paired  t  test 

**%2,  1  degree  of  freedom 

+  Equal  values  for  both  sides  counted  as  0.5  for  left  and  0.5  for  right 


Fig. 


Corrected  Left  Femoral  BMD  (mg/cm2)  Corrected  Left  Humeral  BMD  (mg/cm  ) 

1.  Scatterplots  of  left  and  right  BMD.  Left,  femur;  right,  humerus.  Individual  data  points  are  adjusted  BMD. 


lations  for  individual  bones  and  side-averaged  values  are 
summarized  in  Table  2  and  Figure  2. 

The  correlations  between  femur  and  humerus  are 
nearly  as  strong  as  those  between  left  and  right  bones  at 
a  single  site. 

Discussion 

Animal  models  allow  investigators  to  address  a  variety 
of  experimental  questions  that  could  not  be  studied,  or 
could  be  studied  only  with  great  difficulty  in  human 
subjects.  Many  of  these  feature  experimental  interven¬ 
tion  in  one  limb  and  use  of  the  contralateral  limb  as  a 
control.  DXA  scanning  in  these  experiments  serves  a 
variety  of  related  functions.  One  such  is  to  use  BMD  or 
BMC  as  a  surrogate  measurement  for  more  technically 
demanding  methods  such  as  gravimetry.  Another  is  to 


Table  2.  Intersite  correlation  of  BMD 


Femur/Humerus 

L/L 

L/R 

R/L 

R/R 

Ave/Ave 

BMD 

0.75 

0.70 

0.71 

0.71 

0.80 

Femoral  side  given  first,  L  =  left,  R  =  right,  Ave  =  average, 
P  <  10“5  for  all  correlations,  Pearson  product-moment  cor¬ 
relation 


evaluate  an  explanatory  factor  when  biomechanical 
testing  is  performed.  A  third  is  to  assess  longitudinal 
changes  in  a  single  animal.  Each  of  these  applications 
requires  understanding  of  both  the  performance  char¬ 
acteristics  of  DXA  scanning  and  the  underlying  biology 
of  the  effects  being  sought.  We  have  addressed  the  for- 
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Corrected  Left  Femoral  BMD  (mg/cm2) 


Corrected  Left  Femoral  BMD  (mg/cm2) 


t' 


Corrected  Right  Femoral  BMD  (mg/cm2) 


Corrected  Right  Femoral  BMD  (mg/cm2) 


Average  Corrected  Femoral  BMD  (mg/cm2) 


Fig.  2.  Scatterplots  of  intersite  BMD.  A  Left  femur  v  left  humerus;  B.  Left  femur  v  right  humerus,  C.  Right  femur  v  left  humerus, 
D.  Right  femur  v  right  humerus,  E.  Average  femur  v  average  humerus.  Individual  data  points  are  adjusted  BMD. 


mer  elsewhere  [27]  and  address  the  latter  here.  In  a  large 
experimental  sample  we  have  found  small,  but  signifi¬ 
cant  differences  in  BMD  between  the  left  and  right  sides 
at  both  the  femur  and  the  humerus.  The  average  ad¬ 
justed  BMD  differences  between  sides  are  approximately 
2  mg/cm2  at  both  sites.  We  report  both  the  average 
difference  between  left  and  right  bones  and  the  absolute 
value  of  the  difference  because  the  absolute  value  of  the 
difference  is  relevant  in  interpreting  data  obtained  from 
paired  limbs,  as  in  studies  in  which  one  limb  is  treated 
while  the  contralateral  limb  serves  as  an  untreated 
control.  Further,  we  report  that  correlation  between 
sides  was  moderately  high  at  0.78,  and  only  slightly 
poorer  between  sites. 

The  magnitude  of  the  absolute  right/left  difference  is 
comparable  with  that  arising  from  biologically  impor¬ 
tant  interventions  [28-30]).  The  implication  is  that 
studies  in  which  intervention  is  applied  to  one  limb  with 
the  contralateral  limb  serving  as  a  control  should  be 
randomized  for  whether  the  right  or  left  limb  is  treated. 
This  is  at  variance  with  usual  practice  of  assigning  either 
the  left  or  right  limb  for  intervention  in  all  animals.  The 
additional  burden  to  investigators  is  small,  yet  protects 
against  a  potentially  significant  source  of  experimental 
bias.  Similarly,  in  longitudinal  studies  assessing  regions 


of  interest,  care  should  be  taken  to  compare  the  same 
side  at  each  measurement  in  each  animal.  That  said,  it  is 
important  to  note  that  left-right  side  disparities  have  not 
yet  been  documented  in  most  mouse  strains  or  in  other 
small  animal  species  used  for  limb  comparison  experi¬ 
ments.  Similarly,  our  study  was  limited  to  data  acquired 
by  DXA;  we  have  not  rigorously  proven  that  either  mi- 
cro-CT  scanning  or  histomorphometry  would  reveal  left- 
right  disparities.  In  the  absence  of  such  data,  we  consider 
it  prudent  to  advise  randomization  of  limbs  until  data 
showing  the  absence  of  side  disparities  become  available. 

While  our  data  represent,  to  the  best  of  our  knowl¬ 
edge,  the  first  skeletal  evidence  of  hemispheric  domi¬ 
nance  in  mice,  the  issue  of  dominance  has  been  addressed 
in  different  contexts.  Others  have  investigated  laterali¬ 
zation  in  mice  using  behavioral  [31-38]  and  neuroan- 
atomic  [39,  40]  assays.  These  earlier  studies  all  support 
the  interpretation  that  our  BMD  findings  reflect  the 
animals’  true  biology,  and  suggest  that  a  majority  of 
them  are  left-dominant.  Taken  together,  these  observa¬ 
tions  support  the  notion  that  mice  display  lateralization, 
but  that  little  is  known  regarding  either  the  most  reliable 
methods  of  measuring  it  or  the  mechanisms  underlying  it. 

We  observed  only  a  moderately  high  correlation  of 
0.78  between  left  and  right  BMDs  at  either  the  femur  or 
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the  humerus,  much  lower  than  the  correlation  observed 
between  right  and  left  hips  in  humans  [41—46].  In  addi¬ 
tion,  we  obtained  higher  intersite  correlations  between 
side-averaged  data  than  for  either  ipsilateral  or  contra¬ 
lateral  comparisons.  Lastly,  we  found  that  bone  area  is 
not  normally  distributed  in  our  sample.  Each  of  these 
findings  reflects  the  relatively  large  pixel  size  (0.18  mm  x 
0.18  mm)  relative  to  the  bones  studied,  as  we  have  dis¬ 
cussed  elsewhere  in  greater  detail  [27].  Averages  obtained 
from  2-paired  measurements  are  expected  to  be  closer  to 
the  true  values  than  either  measurement  taken  individ¬ 
ually.  That  side-averaged  data  are  more  highly  correlated 
between  anatomic  sites  than  ipsilateral  data  and  that  the 
correlations  of  ipsilateral  and  contralateral  intersite  data 
are  equivalent  demonstrates  that  the  magnitude  of  indi¬ 
vidual  measurement  errors  is  large  relative  to  the  differ¬ 
ences  between  right-and  left-sided  data.  Comparison  of 
unilateral  and  bilateral  femur  BMD  measurement  in 
humans  has  led  to  a  similar  interpretation  [44,  47],  Our 
reported  intersite  correlations  are  only  slightly  poorer 
than  right/left  correlations  at  a  single  anatomic  site, 
suggesting  that  femoral  BMD  is  highly  predictive  of 
humeral  BMD  and  vice  versa,  supporting  the  notion  that 
the  various  long  bones  share  similar  biology. 

We  were  unable  to  find  any  prior  reports  of  mor¬ 
phologic  asymmetry  of  DXA  traits  in  mice.  In  contrast, 
multiple  human  investigations  have  compared  densito- 
metric  features  of  bones  and  related  the  observations  to 
known  or  hypothesized  differences  in  mechanical  load¬ 
ing.  Some  investigators  did  not  relate  side  comparisons 
to  handedness  [42,  44],  while  those  who  did  disagree 
about  the  relationship  of  handedness  to  BMD  [43,  45] 
and  [48-50]. 

In  addition  to  the  caveats  noted  above,  our  study  has 
several  further  limitations,  which  readers  should  bear  in 
mind.  The  mice  studied  here  derive  approximately  3/4  of 
their  genome  from  C3H/DiSnA  and  the  other  1/4  of 
their  genome  from  C57BL/10ScSnA. 

Our  results  may  not  be  generalizable  to  other  strains 
of  mice,  let  alone  to  other  species.  DXA  scanning  was 
performed  ex  vivo  in  air,  with  bones  positioned  on  a 
specimen  tray  provided  by  the  instrument  manufacturer, 
and  were  studied  individually.  The  PIXImus  instrument 
and  software  are  designed  for  optimal  performance  in 
scanning  intact  animals  for  total  body  BMD.  The  PIX- 
mus  instrument  we  used  demonstrated  a  significant, 
systematic  error  that  depended  on  specimen  position  [27]. 

In  spite  of  these  limitations  and  adjustment  for  them, 
the  large  number  of  specimens  allowed  us  to  reveal  a 
small  but  significant  difference  in  right  and  left  long 
bone  BMD.  In  this  regard,  the  data  are  similar  in  nature 
to  those  obtained  in  clinical  studies  of  blood  pressure.  In 
blood  pressure  studies,  each  individual  measurement  is 
subject  to  error  of  a  magnitude  that  usually  exceeds  the 
treatment  effect.  However,  the  treatment  effect  can  still 
be  detected  with  a  sufficient  sample  size.  Indeed,  power 


calculations  using  our  reported  precision  data  for  ex  vivo 
PIXImus  densitometry  [27]  suggest  that  our  sample’s 
power  to  detect  BMD  disparities  of  the  observed  mag¬ 
nitude  approaches  unity.  Given  the  minor  additional 
effort  required  to  assimilate  our  findings  into  the  design 
of  future  studies,  we  believe  that  our  data  provide  a 
biological  basis  for  preferring  that  single-limb  inter¬ 
ventions  in  mice  be  randomized  prospectively. 
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Abstract 

Dentinogenesis  imperfecta  (DI)  is  a  common  but  variable  feature  of  osteogenesis  imperfecta  (OI).  The  ColIa2°'m  mutation  ( oim )  is  a 
well-studied  mouse  model  of  chain  deficiency  OI.  Heterozygous  oimh  mice  have  subtle  skeletal  fragility,  while  homozygous  oimloim  mice 
have  marked  skeletal  fragility.  To  further  define  the  consequences  of  oim  mutation,  we  examined  teeth  by  light  and  scanning  electron 
microscopy  (SEM).  The  dental  phenotype  in  Colla2oim  (oim)  mice  is  more  severe  in  incisors  than  in  molars  and  includes  changes  in  pulp 
chamber  size,  tooth  shape,  and  dentin  ultrastructure.  Teeth  in  oimloim  animals  are  clinically  fragile,  while  oiml+  teeth  are  grossly  normal. 
Incisor  pulp  chamber  areas  (in  pm2)  are:  upper  +/+  =  358  ±  75,  lower  +/+  =  671  ±  162,  upper  oim/+  =161  ±  54,  lower  oim/+  =  156  ±  19, 
upper  oimloim  =  6900  ±  1040,  and  lower  oimloim  =  66  ±  62  (P  <  10-s).  Incisor  non-pulp  chamber  cross-sectional  areas  (in  pm2),  reflecting 
dentin  areas,  are:  upper +/+  =  39,000  ±  1670,  lower +/+  =  35,600  ±  1980,  upper  oim/+  =  47,500  ±2510,  lower  oim/+  =  26,000  ±  1830,  upper 
oimloim  =  29,800  +  315,  and  lower  oimloim  =  36,800  ±  3450  (P  <  ICT5)-  Ultrastructural  abnormalities  are  more  pronounced  in  incisors  than 
in  molars  and  depend  on  dosage  of  the  mutant  allele.  These  include  reduction  in  the  number  and  regularity  of  spacing  of  the  dentinal  tubules, 
lesser  mineralization,  and  blurring  of  the  boundary  between  peritubular  and  intertubular  dentin.  Our  findings  demonstrate  that  both  oimloim 
and  oim/+  mice  suffer  from  DI.  The  more  severe  incisor  phenotype  may  reflect  incisors’  continuous  growth. 

©  2005  Elsevier  Inc.  All  rights  reserved. 
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Introduction  '  J* 

\ 

Osteogenesis  imperfecta  (OI),  also  known  as  brittle  bone 
disease,  is  an  inherited  clinical  syndrome  marked  by  skeletal 
fragility  arid  associated  abnormalities  (reviewed  by  [1-3]). 

Nearly  all  cases  are  caused  by  mutations  of  the  genes 
encoding  type  I  collagen.  In  the  most  severe  cases,  OI  is 
lethal  in  utero,  while  in  the  mildest  cases  it  is  subclinical 
[4-6].  The  associated  abnormalities  include  dentinogenesis 
imperfecta  (DI),  skin  fragility,  hearing  loss,  scleral  discol¬ 
oration,  and  other  manifestations  of  connective  tissue 
fragility.  These  varied  clinical  manifestations  reflect  the 
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abundance  of  type  I  collagen  in  various  connective  tissues, 
including  teeth,  skin,  ligaments,  fasciae,  and  tendons. 

Classical  features  of  DI  include  discoloration  of  teeth, 
increased  tooth  fragility,  and  obliteration  of  the  pulp 
chambers.  Histological  and  ultrastructural  abnormalities 
reported  in  human  DI  include  a  decreased  number  of 
irregularly  shaped  dental  tubules,  remnants  of  capillary 
inclusions,  obliterated  pulps,  morphological  abnormalities 
of  dental  collagen  fibers,  and  mineralization  defects  [7-10]. 

Several  murine  models  of  OI  exist,  including  the 
Colla2°'m  (oim)  mutation.  This  mutation  is  a  frameshift 
of  the  gene  encoding  the  a2  chain  of  type  I  collagen, 
resulting  in  quantitatively  deficient  a2(I)  chain  protein 
synthesis  [11].  The  resulting  phenotype  is  obvious  in  oim/ 
oim  homozygotes,  which  display  impaired  growth  and 
marked  skeletal  fragility  [11],  Phenotypic  abnormalities 
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54  are  far  more  subtle  in  oim/+  heterozygotes,  but  are  never- 

55  theless  apparent  by  biomechanical  testing  [12,13].  Bio- 

56  chemical  manifestations  of  the  mutation  include  reduced 

57  collagen  cross-linking  [14],  reduced  apatite  crystallinity 

58  [15],  and  increased  proline  hydroxylation  [14], 

59  While  considerable  attention  has  been  devoted  to 

60  characterizing  bones  in  oim/oim  and  oim/+  mice,  the  teeth 

61  remain  poorly  characterized  in  mutants.  To  address  this  gap 

62  in  our  understanding  of  the  consequences  of  oim  mutation, 

63  we  examined  teeth  in  mice  of  all  3  Collal  genotypes  by 

64  light  and  scanning  electron  microscopy  (SEM).  We  report 

65  that  both  oim/+  and  oim/oim  display  abnormal  features  that 

66  are  similar  to  those  described  in  human  DI,  the  phenotype 

67  depends  on  both  mutant  allele  dosage  and  the  specific  tooth 

68  examined.  In  particular,  we  report  the  heterozygous  pheno- 

69  type  is  more  pronounced  in  incisors  than  in  molars  and  that 

70  upper  and  lower  incisors  display  different  abnormalities  of 

71  pulp  chamber  size  and  morphology. 
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86  Light  microscopy 

\ 

87  Whole  teeth  were  examined  using  a  Leica  DM-LB 

88  microscope  at  5x  magnification.  Images  were  stored 

89  digitally  as  TIFF  files  and  labeled  using  Photoshop  5.0  LE 

90  (Adobe  Systems,  San  Jose,  CA). 

91  SEM 

92  In  order  to  examine  pulp  chamber  size  using  SEM,  we 

93  fractured  teeth  at  the  level  of  the  junctional  epithelium. 

94  Samples  were  air  dried,  sputter  coated  with  platinum- 

95  palladium,  and  imaged  with  a  Hitachi  S570  SEM  with 

96  Gatan  Digital  Capture  System.  To  analyze  the  surface  of 

97  specimens  over  a  wide  range  of  magnifications,  we  scanned 

98  across  the  surface  of  the  specimen  using  a  Hitachi  S570 

99  SEM  with  Gatan  Digital  Capture  System.  For  light  micro- 

100  scopy,  images  were  saved  in  TIFF  format  and  labeled  with 

101  Photoshop  5.0  LE. 


Materials  and  methods 

Mice 

Mice  used  in  this  study  were  male  and  female  offspring 
of  oim/+  parents  and  were  genotyped  at  Colla2  at  weaning 
by  direct  sequencing  [16].  Animals  were  maintained  in  a  12- 
h  light/dark  photo  period,  were  fed  powdered,  irradiated 
PICO  5058  rodent  chow  (Purina,  St.  Louis,  MO)  and 
autoclaved  tap  water  ad  lib,  and  housed  in  groups  of  up  to  5 
mice/500  cm2  cage  between  weaning  and  sacrifice  at  11 
weeks  of  age.  Following  sacrifice  by  C02  asphyxiation  at  1 1 
weeks  of  age,  heads  were  stored  frozen  until  teeth  were 
dissected  free  of  alveolar  bone  for  study.  This  work  satisfied 
the  Hospital  for  Special  Surgery’s  requirements  for  the 
ethical  use  of  laboratory  research  animals. 

JV 


Image  processing 

We  used  SigmaScan  Pro  version  5.0  (Jandel  Scientific)  to 
manually  select  the  non-pulp  chamber  and  pulp  chamber 
regions  of  interest  and  calculate  their  areas. 

Statistics 


At  least  3  mice  of  each  genotype  were  examined  at  each 
site.  Pulp  chamber  and  non-pulp  areas  are  expressed  as 
mean  +  standard  deviation.  We  used  SigmaStat  version  2.03 
(SPSS)  to  perform  statistical  analyses.  Two-factor  ANOVA 
(genotype,  3  strata;  and  location,  2  strata)  was  performed 
following  square  root  transformation  of  the  raw  data.  Post 
hoc  comparisons  were  performed  using  Tukey’s  test. 


Results 


Clinical  tooth  phenotype  in  oim/oim  and  oim/+  mice 


Tooth  fragility  is  a  prominent  feature  of  the  oim/oim 
phenotype.  Teeth  break  spontaneously  if  animals  are  fed  a 
pelleted  diet.  In  order  to  prevent  this  problem,  our  standard 
husbandry  practices  include  provision  of  powdered  chow  to 
oim/oim  and  oim/+  animals.  In  our  experience,  oim/oim 
upper  incisors  uniformly  display  brownish  discoloration 
relative  to  both  +/+  and  oim/+  teeth,  but  there  is  no  obvious 
abnormality  of  intact  teeth  either  from  other  sites  in  11- 
week-old  oim/oim  animals  or  in  any  of  the  teeth  of  oim/+ 
animals,  as  shown  in  Fig.  1.  In  addition,  no  phenotypic 
differences  were  observed  between  males  and  females  for 
any  of  the  genotypes. 


Genotype  and  position-dependent  variation  of  incisor 
morphology 


While  all  3  Colla2  genotypes  have  similar  lower  incisor 
morphology,  oim/oim  upper  incisors  display  a  distinctive 
triangular  cross-section,  as  demonstrated  in  Fig.  2.  Similarly 
to  lower  incisors,  there  is  no  alteration  of  molar  morphology 
across  Colla2  genotypes  (data  not  shown). 

In  addition  to  overall  tooth  morphology,  Fig.  2  also 
reveals  notable  genotype  and  position-dependent  differences 
of  pulp  chamber  size  in  incisors.  Pulp  chamber  areas  in  pm2 
are  as  follows:  upper  +/+  =  358  ±  75,  lower +/+  =  671  ±  162, 
upper  oim/+  =  1 6 1  ±  54,  lower  oim/+  =  1 56  ±19,  upper  oim/ 
oim  =  6900  ±  1040,  and  lower  oim/oim  =  66  ±  62.  These 
values  differ  significantly  by  2-factor  ANOVA,  as  summar¬ 
ized  in  Table  1. 

The  non-pulp  chamber  cross-sectional  area  of  the 
incisors  also  demonstrates  genotype  and  position-depend¬ 
ent  differences.  Non-pulp  chamber  cross-sectional  area 
reflects  dentin  area,  as  all  teeth  exhibit  only  a  thin  outer 
layer  of  enamel.  Non-pulp  chamber  areas  in  are  as  follows: 
upper  +/+  =  39,000  +  1670,  lower  +/+  =  35,600  ±  1980, 
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Fig.  1.  Transillumination  of  upper  and  lower  incisors.  Images  represent  a  medial  proximal  view  of  the  light  passing  through  the  tooth.  Panels  A  to  C  are 
from  upper  incisors.  Panels  D  to  F  are  from  lower  incisors.  Panels  A  and  D  illustrate  +/+,  B  and  E  illustrate  oiml+,  and  panel  C  illustrates  oimloim.  Scale 
bar  =  500  pm. 

-gjp  ' 

149  upper  oiml+  =  47,500  ±  2510,  lower  oim/+  =  26,000  ±  f  ANOVA,  as  summarized  in  Table  1.  In  contrast  to  pulp 

150  1830,  upper  oimloim  =  29,800  ±315,  and  lower  oimloim  =  ’  chamber  area,  where  genotype  has  a  strong  impact,  genotype 

151  36,800  ±  3450.  These  values  differ  significantly  by  2-factor  has  only  borderline  significance  for  non-pulp  area.  However, 

/•  \. 

+/+  +/oim  oimloim 


152 

153 

154 


Fig.  2.  SEM  of  tooth  fracture  surfaces.  Images  represent  a  cross-sectional  fracture  in  upper  and  lower  incisors  at  the  level  of  the  junctional  epithelium.  Panels  A 
to  C  show  upper  incisors  and  panels  D  to  F  show  lower  incisors.  Teeth  are  oriented  with  the  vestibular  surface  at  the  top  and  the  lingual  surface  at  the  bottom. 
Panels  A  and  D  show  +/+,  panels  B  and  E  show  oiml+,  and  panels  C  and  F  show  oimloim.  Scale  bar  =  500  pm. 


4 


C.E.  Lopez  Franco  et  al.  /  Bone  xx  (2005)  xxx-xxx 


tl.l 

tl.2 

Table  1 

Incisor  cross-sectional 

areas 

tl.3 

Pulp  chamber 
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Overall 

Genotype 

P  <  10“5 

tl.5 

Position 

P  <  1(T5 

tl.6 

Interaction 

P  <  10“5 

tl.7 

Within  upper 

+/+  vs.  oim/+ 

NS 

tl.8 

+/+  vs.  oim/oim 

P=  1.9  x  !0“4 
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oim/+  vs.  oim/oim 

P  =  1.9  x  10“4 
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Within  lower 

+/+  vs.  oim/+ 
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tl.ll 

+/+  vs.  oim/oim 

0.0037 

tl .  12 

oim/+  vs.  oim/oim 

NS 
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Within  +/+ 

Upper  vs.  Lower 

P  =  1.7  x  10~4 

tl .  14 

Within  oim/+ 

Upper  vs.  Lower 

NS 

tl.15 

Within  oim/oim 

Upper  vs.  Lower 

P=  1.7  x  10”4 

tl.10 

tl.17 

Non-pulp  area 

tl.  18 

Overall 

Genotype 

P  =  0.041 

tl.10 

Position 

P  =  3.1  I0~4 

tl.20 

Interaction 

P  <  10“5 

tl.21 

Within  upper 

+/+  vs.  oim/+ 

P  =  0.0070 

tl  .22 

+/+  vs.  oim/oim 

P  =  0.0018 

tl.23 

oiml+  vs.  oim/oim 

P  =  1.9  x  10“4 

tl.24 

Within  lower 

+/+  vs.  oim/+ 

P  =  8.6  x  10”4 

tl.25 

+/+  vs.  oim/oim 

NS 

tl.26 

oim/+  vs.  oim/oim 

P  =  4.5  x  10^4 

tl.27 

Within  +/+ 

Upper  vs.  Lower 

NS 

t.1.28 

Within  oim/+ 

Upper  vs.  Lower 

P=  1.7  x  10“4 

t.1.29 

Within  oim/oim 

Upper  vs.  Lower 

P  0.0045  J- 

155  the  genotype  x  location  interaction  is  highly  significant  for 

156  both  pulp  and  non-pulp  areas. 


«ij.s  A'.- 

157  Disturbed  dentin  ultrastructure  in  oim/oim  and  oim/+ 

/- '  >.  x 

158  As  shown  in  Fig.  3,  SEM  demonstrates  that  compared  to 

159  wild-type  animals,  both  oim/oim  and  oim/+  strains  display  a 

160  reduction  in  the  density  and  regularity  of  dentinal  tubules  in 

161  the  matrix  adjacent  to  the  pulp  chamber.  The  reduction  in 

162  dentinal  tubule  density  is  dependent  on  mutant  allele 

163  dosage,  with  oim/+  animals  displaying  a  phenotype 


intermediate  between  those  of  the  +/+  and  oim/oim 
homozygotes. 

Abnormal  ultrastructural  features  are  also  apparent  closer 
to  the  dentino-enamel  junction,  as  shown  in  Figs.  4  and  5, 
obtained  from  incisors  and  molars,  respectively.  Fligh-power 
images  of  incisor  dentin  reveal  that  the  odontoblast 
processes  of  +/+  teeth  are  regularly  spaced  and  uniformly 
invested  with  accreted  mineral  (Fig.  4G).  Moreover,  fracture 
of  the  peritubular  and  intertubular  dentin  occurs  in  2 
discrete,  well-defined  planes.  Teeth 'from  oim/+  heterozy¬ 
gotes  (Fig.  4H)  have  lesser  amounts  of  mineral  adherent  to 
the  odontoblast  processes,  allowing  visualization  of  protein 
fibrils.  While  the  oim/+  fracture  plarie  is  undulated,  there  is 
a  less  obvious  distinction  between  the  peritubular  and 
intertubular  dentin  fracture  planes.  In  oim/oim  homozygotes 
(Fig.  41),  protein  fibrils  are  abundant  and  it  is  difficult  to 
distinguish  between  peritubular  and  intertubular  regions. 

The  dental  phenotypes  described  above  are  more 
pronounce'cl  in  incisors  (Fig.  4)  than  in  molars  (Fig.  5).  In 
contrast,  ultrastructural  abnormalities  are  equally  evident  in 
all  dim/oim  teeth.  Because  of  their  fragility,  we  were  unable 
to  fracture  oim/oim  molars  in  a  single  plane,  our  attempts  to 
,  do  so  resulting  in  a  secondary  fracture  perpendicular  to  the 
enamel-cementum  junction  (Fig.  5C).  That  this  failure  is 
limited  to  the  molars  may  simply  be  a  consequence  of  the 
different  tooth  shapes.  Incisors’  greater  ellipticity  results  in 
the  sufficiency  of  a  relatively  small  force  to  achieve  fracture 
in  the  direction  of  the  short  axis.  Relatively  larger  forces  are 
needed  to  fracture  the  less  elliptical  molars,  and  the  applied 
forces  result  in  both  bending  and  crushing. 

Phenotypic  variability  in  heterozygotes 

There  is  also  heterogeneity  among  oim/+  animals 
studied  at  the  same  site.  Fig.  6  illustrates  variability  of 
pulp  chamber  size  in  oim/+  upper  incisors.  Expression  of 
the  other  structural  abnormalities  arising  from  the  hetero- 
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Fig.  3.  SEM  of  dentin  adjacent  to  the  pulp  chamber.  Panel  A  shows  +/+,  B  shows  oim/+,  and  panel  C  shows  oim/oim.  Pulp  chambers  are  at  the  lower  center  of 
each  panel.  Scale  bar  =  5  pm. 


Fig.  4.  SEM  of  lower  incisors.  Panels  A,  D,  and  G  show  +/+,  panels  B,  E,  and  H  show  oim/+,  and  panels  C,  F,  and  1  show  oim/oim.  From  right  to  left, 
orientation  is  vestibule-lingual.  Rectangles  in  panels  A-C  indicate  fields  shown  in  panels  D-F  and  ovals  in  panels  D-F  indicate  fields  shown  in  panels  G-I. 
Scale  bars  are  500  pm  in  panels  A-C,  50  pm  in  panels  D-F,  and  5  pm  in  panels  G-I. 
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199  zygous  presence  of  the  oim  mutation  is  als6  variable  (data 

200  not  shown).  . }>  % 


201  Discussion  _  \ 

f  ' 

202  The  data  shown  heie  demonstrate  that  both  oim/oim  and 

203  oiml+  exhibit  dentaii  abnormalities.  These  include  altered 

204  pulp  chamber  size,  reduced  numbers  of  dental  tubules,  and 

205  abnormalities  of  dentin  mineralization,  corresponding  to  the 

206  abnormalities  encountered  in  human  DI.  These  features  are 

207  generally  more  apparent  in  incisors  than  in  molars.  The 

208  murine  oim  system  therefore  can  serve  as  a  model  of  type 

209  Ial  DI,  in  which  DI  is  a  manifestation  of  primary  collagen 

210  disease.  The  dental  phenotype  of  oim  resembles  the  skeletal 

211  phenotype  insofar  as  its  severity  is  dependent  on  the  number 

212  of  mutant  alleles  present.  Abnormalities  are  obvious  in  oim/ 

213  oim  homozygotes  in  upper  and  lower  incisors,  and  in  oim/+ 

214  heterozygotes  in  upper  incisors.  It  is  worth  noting  that  in 

215  humans  with  OI,  ultrastructura!  abnormalities  have  been 

216  found  in  the  teeth  even  in  the  absence  of  clinically  apparent 

217  DI  [8,9],  The  presence  of  a  microscopic  DI  phenotype  in 

218  oim/+  mice  is  consistent  with  this  observation  in  humans. 

219  We  found  that  the  incisor  non-pulp  cross-sectional  area  is 

220  not  simply  related  to  dosage  of  the  mutant  allele,  but 


displays  a  strong  genotype  x  position  interaction.  While  the  221 
oim/+  dentin  area  is  reduced  relative  to  +/+  in  lower  222 
incisors,  it  is  increased  in  upper  incisors.  The  oim/oim  223 

dentin  area  is  reduced  relative  to  +/+  in  both  upper  and  224 

lower  incisors,  but  is  greater  than  that  of  oim/+.  The  pattern  225 
of  dentin  area  is  difficult  to  reconcile  with  a  simple,  mutant  226 
allele  dose-dependent  reduction  of  a2  chain  synthesis.  Any  227 
of  several  distinct  mechanisms  may  contribute  to  the  228 
observed  changes,  including  alteration  of  odontoblast  229 
numbers,  alteration  of  synthetic  activity  within  individual  230 
odontoblasts,  and  alteration  of  dentin  matrix  composition.  231 
Distinguishing  among  these  mechanisms  is  an  important  232 
task  for  future  work.  233 

The  greater  severity  of  the  oim/+  phenotype  in  incisors  234 
compared  to  molars  is  another  intriguing  finding.  One  235 
possible  explanation  for  this  observation  is  that  in  mice,  236 

incisors  grow  throughout  life,  while  molars  do  not.  It  is  237 

therefore  possible  that  in  molars,  slower  growth  is  able  to  238 
provide  some  compensation  for  a  reduced  type  I  collagen  239 
biosynthesis,  while  in  incisors  the  developmental  program  240 
of  continued  growth  and  the  persistent  loading  demands  241 
alter  such  an  adaptation.  242 

In  both  humans  and  mice,  DI  is  an  inconstant  OI  243 
manifestation.  While  DI  is  present  in  oim,  it  is  absent  in  244 
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( Mov-13 ).  This  mutation  is  a  retroviral 
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Fig.  5.  Scanning  electron  micrographs  of  (upper  first)  molars.  Panels  A,  D,  and  G  show  +/+,  panels  B,  E,  and  H  show  oim/+,  and  panels  C,  F,  and  I  show  oim/ 
oim.  Rectangles  in  panels  A-C  indicate  fields  shown  in  panels  D-F  and  ovals  in  panels  D-F  indicate  fields  shown  in  panels  G-I.  Scale  bars  are  500  pm  in 
panels  A-C,  50  pm  in  panels  D-F,  and  5  pm  in  panels  G-I.  * 


oim/oim 


+/oim 


oim/oim 


G.E.  Lopez  Franco  el  al.  /  Bone  xx  (2005)  xxx-xxx 


1 


246  insertion  into  the  gene  encoding  otl(I),  resulting  in 

247  decreased  transcriptional  activity  [17-19],  Mov-13/Mov- 

248  13  homozygotes  die  in  utero  due  to  rupture  of  the  great 

249  vessels  [20,21],  while  heterozygotes  have  skeletal  fragility 

250  [22-25],  However,  the  expression  level  is  apparently 

251  normal  in  teeth,  so  that  Mov-13/+  mice  have  normal  teeth 

252  [26],  To  the  best  of  our  knowledge,  dental  phenotypes  have 

253  not  been  reported  in  other  murine  01  models  [27-30], 

254  There  are  important  limitations  to  the  work  reported  here. 

255  First,  we  have  assessed  dental  phenotypes  in  oim/+  and  oim / 

256  oim  mice  at  only  a  single  age,  1 1  weeks.  It  is  possible  that 

257  both  additional  abnormalities  and  additional  phenotypic 

258  variability  would  be  apparent  if  teeth  were  studied  over  a 

259  range  of  ages.  In  human  01,  for  example,  DI  is  more 

260  prevalent  in  primary  teeth  than  in  permanent  teeth.  In  two 

261  separate  studies  of  dentin  from  mature,  1-year-old  oim/oim 

262  mice,  molars  [31]  and  incisors  [32]  were  found  to  exhibit 

263  less  dramatic  mineralization  alterations,  in  contrast  to  the 

264  findings  in  the  current  study.  Second,  oim  is  maintained  on 

265  an  outbred  B6C3  background,  deriving  its  alleles  ultimately 

266  from  either  C57BL/6J  or  C3HeB/J.  The  dental  phenotype 

267  might  differ  if  the  mutation  was  to  be  transferred  to  a 

268  different  genetic  background.  Moreover,  segregation  of 

269  parental  alleles  may  contribute  to  the  phenotypic  variability 

270  observed  in  oiml+  heterozygotes.  Third,  our  description  of 

271  the  tooth  phenotype  in  oim  mice  is  limited  to  features 

272  apparent  by  light  or  scanning  electron  microscopy.  In 

273  previously  mentioned  studies  on  1 -year-old  teeth,  the 

274  mineral  composition  in  oim  mice  was  assessed  by  neutron 

275  activation  analysis  [32]  and  by  FTIR  spectroscopy  [31], 

276  techniques  that  probe  the  chemical  and  molecular  structure 

277  of  the  mineral.  It  is  possible  that  further  investigation  of  the 

278  teeth  from  growing  animals  with  Such  techniques  would 

279  help  elucidate  the  mechanisms  by  which  deficiency  of  a2(I) 

280  leads  to  a  cascade  of  higher-order  dentin  defects. 

281  In  summary,  we  have  shown  that  both  oim  / oim  and  oim/+ 

282  display  dentinogenesis’  imperfecta.  DI  in  homozygous 

283  mutants  is  manifested  grossly  by  tooth  fragility.  Both 

284  heterozygotes  and  homozygous  mutants  have  significant 

285  dental  abnormalities.  However,  both  homozygotes  and 

286  heterozygotes  demonstrate  alterations  of  the  pulp  chamber 

287  and  decreased  numbers  of  dental  tubules.  As  in  humans, 

288  microscopy  is  more  sensitive  than  clinical  examination  in 

289  detecting  DI  arising  in  the  context  of  OI.  The  oim  model  may 

290  prove  useful  for  investigating  potential  pharmacological 

291  interventions  for  mild  forms  of  DI  and  for  better 

292  understanding  of  collagen’s  role  in  mineralization. 
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INTRODUCTION 


Abnormalities  of  structural  bone  proteins  have  mechanical  consequences.  These 
have  been  most  thoroughly  characterized  in  animal  models  that  allow  performance  of 
biomechanical  testing  under  controlled  conditions.  However,  even  in  the  context  of 
animal  models,  most  mechanical  investigations  have  been  conducted  at  the  whole  bone 
level  rather  than  at  the  tissue  level. 

Osteogenesis  imperfecta  is  a  family  of  disorders  caused  by  mutations  of  the 
genes  encoding  type  I  collagen,  the  major  structural  protein  in  the  bone  matrix.  Clinical 
consequences  of  01  vary  markedly,  ranging  from  lethal  disease  to  subtle  defects  that 
approach  the  limits  of  detection.  At  the  macroscopic  level,  many  of  the  abnormalities 
resulting  from  01  are  related  to  deficient  bone  mass  and  abnormal  architecture, 
complicating  determination  of  the  specific  contribution  that  tissue  level  properties  make 
to  whole  bone  strength.  Moreover,  physiological  adaptation  to  the  presence  of  an 
abnormal  matrix  protein  during  growth  and  development  may  obscure  the  tissue  level 
consequences  of  the  mutation. 

Mice  harboring  the  Col1a2oim  mutation  ( oim )  are  a  well-studied  model  of  01. 
This  mutation  is  a  frameshift  of  the  gene  encoding  the  a2  chain  of  type  I  collagen, 
resulting  in  quantitatively  deficient  a2(l)  chain-synthesis.  The  resulting  phenotype  is 
obvious  in  oim/oim  homozygotes,  which  display  impaired  growth  and  marked  skeletal 
fragility.  Phenotypic  abnormalities  are  far  subtler  in  oim/+  heterozygotes,  but  are 
nevertheless  apparent  by  biomechanical  testing.  Biochemical  manifestations  of  the 
mutation  include  reduced  collagen  cross-linking,  reduced  apatite  crystallinity,  and 
increased  proline  hydroxylation. 

In  order  to  better  characterize  the  tissue-level  mechanical  consequences  of  the 
oim  mutation,  we  performed  nanoindentaton  of  unembedded  cortical  bone  in  C57BL/6J 
and  B6C3FeLe  a/a  with  various  Col1a2  genotypes  sacrificed  at  11  and  17+1  week  of 
age.  We  report  that  Col1a2  genotype,  genetic  background,  age  and  possibly  specimen 
orientation  each  affect  tissue-level  mechanical  performance  as  assessed  by 
nanoindentation  interpreted  by  2  analytical  methods.  Further,  Atomic  Force  Microscopy 


(AFM)  revealed  heterogeneous,  site  specific  differences  in  topographical  roughness 
among  Col1a2  genotypes  compatible  with  the  existence  of  focal  areas  of  impaired 
mineralization.  Moreover,  examination  of  unembedded  samples  allowed  us  to  analyze 
compliance,  providing  additional  insight  into  bone’s  mechanical  behavior. 


MATERIAL  AND  METHODS 


Mice:  Animals  were  housed  2  to  5/500  cm2  cage  on  corncob  bedding,  fed  powdered, 
irradiated  PICO  5058  or  PMI  5001  laboratory  rodent  chow  and  tap  water  ad  lib  and 
exposed  to  12  hour  light/12  hour  dark  cycle.  Mice  used  for  bone  analyses  were 
sacrificed  at  11  and  17  +  1  week  of  age  and  bone  specimens  obtained  for  study.  Bones 
were  stored  frozen  until  thawed  for  preparation. 

Sample  preparation:  We  prepared  femoral  or  tibial  diaphyseal  specimens  for  study  in 
either  the  longitudinal  (polished  approximately  parallel  to  the  bone’s  long  axis)  or  cross 
(cut  in  cross-section  and  polished  approximately  perpendicular  to  the  bone’s  long  axis) 
orientation.  Samples  were  polished  sequentially  with  100,  50,  20,  10,  5,  and  3  pm 
diamond  cloth,  sonicated  in  water  for  10-15  sec  at  47  Hz  to  remove  adherent  debris, 
and  mounted  on  stainless  steel  discs  with  superglue.  Samples  were  prepared  and 
studied  without  embedding. 

Imaging:  Scanning  electron  microscopy  (SEM)  images  were  acquired  with  a  Hitachi 
S570  at  10  kV  and  7  mm  working  distance  after  platinum  sputter  coating.  AFM  images 
were  obtained  with  the  Triboscope  (Hysitron,  Minneapolis,  MN)  nanoindentation  system 
prior  to  and  following  indentation  testing.  SEM  and  AFM  images  were  used  to  measure 
the  area  of  indentation  and  assess  material  pileup.  Indent  dimensions  were  determined 
by  using  AFM  and  SEM  images  and  indent  area  was  calculated  from  the  side  lengths  by 
Heron’s  formula  {Press,  1978  #15}. 

Nanoindentation:  We  performed  nanoindentation  using  a  Triboscope  system.  Loading, 
holding  and  unloading  segments  described  the  force  profile  applied  to  the  sample  by 
using  a  diamond  Berkovich  tip  at  loads  between  3000  and  8000pN.  We  performed  a 
minimum  of  13  indents  per  bone.  Prior  to  testing  bone  samples,  a  series  of  indents  in 
fused  quartz  was  performed  to  determine  the  machine  compliance  and  calibrate  the 
area  function.  Data  were  analyzed  using  the  Oliver  and  Pharr  method  as  implemented 
by  the  Hysitron  instrument  software  and  manually  by  an  elasticity  analysis  as  described 
previously  (13).  Meyer  hardness  was  calculated  as  H  =  L/A,  with  L  =  maximum  load 
and  A  =  area  of  indentation.  Young’s  Modulus  was  calculated  from  reduced  Young’s 


Modulus  (Er)  assuming  a  specimen  Poisson  ratio  =  0.3  (4).  Specimen  compliance  and 
tip  properties  were  determined  by  using  the  Jo  parameter,  as  previously  described  (13). 
Jo  =  H/E2eff,  where  E  is  the  effective  modulus,  determined  jointly  by  the  Young’s  moduli 
of  the  sample  and  indenter,  the  Poisson  ratios  of  the  sample  and  indenter,  and  a 
constant  determined  by  the  geometry  of  the  indenter  tip.  This  treatment  allows  the 
measured  unloading  compliance,  C,  to  be  resolved  into  the  sum  of  the  machine 
compliance,  Cm  and  the  sample  compliance,  C0.  By  plotting  L1/2  v  L1/2C,  for  indents 
performed  at  varying  maximum  loads,  one  obtains  a  regression  line  whose  intercept  is 
J01/2  and  whose  slope  is  Cm.  The  region  between  95%  and  20%  of  the  unloading 
portion  of  the  curve  is  used  to  determine  the  slope. 

Statistical  analysis:  Data  are  reported  as  mean  +  standard  deviation  unless  otherwise 
noted.  Comparison  among  ages,  genotypes,  and  orientations  was  performed  by 
ANOVA  with  post  hoc  application  of  Tukey’s  test. 


RESULTS 


ATOMIC  FORCE  MICROSCOPY  IMAGES  SUGGEST  FOCAL  MINERALIZATION 
IMPAIRMENT. 

AFM  images  showed  in  figure  1  illustrate  some  of  the  qualitative  topographical 
differences  observed  among  Col1a2  genotypes.  In  the  +/+  image,  no  fibers  are 
apparent.  In  the  oim/+  image,  some  fibers  can  be  identified,  while  fibers  are  prominent 
in  the  oim/oim  image.  Other  areas  of  oim/+  and  oim/oim  cortical  bone  revealed  a 
topography  more  similar  to  that  of  the  wild  type,  but  we  did  not  encounter  any  fibrous 
regions  in  the  +/+  bone  (not  shown).  We  interpret  these  observations  as  suggesting  the 
presence  of  focal  mineralization  impairment  in  the  oim/+  and  oim/oim  tissue. 

REDUCED  YOUNG’S  MODULUS  (ER)  AND  HARDNESS  (H)  (GPA)  FROM 
TRIBOSCOPE  SOFTWARE  AND  FROM  COMPLIANCE  CORRECTION. 

We  analyzed  the  nanoindentation  data  in  2  ways.  First,  we  used  the  Triboscope 
instrument  software  to  implement  analysis  by  the  Oliver  and  Pharr  method.  Second,  we 
performed  a  manual  compliance  analysis.  Separate  statistical  analyses  of  the  effects  of 
genotype,  age,  and  specimen  orientation  were  performed  for  each  analysis. 

Figure  2  shows  a  typical  indent  and  load-displacement  curve.  Table  1 
summarizes  the  hardness  data  and  Table  2  the  Young’s  modulus  data. 

Both  methods  find  that  hardness  is  greater  in  the  longitudinal  than  in  the  cross 
sample  orientation  (P  <  10'5).  At  17  weeks,  there  is  also  a  significant  effect  of  genotype 
(P  =  0.009),  but  there  is  no  significant  interaction  between  genotype  and  orientation. 
When  comparing  oim/+  and  +/+  genotypes  at  11  and  17  weeks,  there  are  significant 
effect  of  age  (P  <  10'5)  and  genotype  (P  =  0.007)  with  a  marginally  significant  interaction 
between  them  (P  =  0.02).  Limiting  comparisons  to  the  3  Col1a2  genotypes  at  11 
weeks,  there  is  a  significant  difference  between  oim/oim  and  the  other  2  genotypes  by 
manual  analysis  (P  =  2  X  10'5)  and  **  by  software  analysis  (insert  stats). 

When  comparing  genotype  and  age,  there  is  a  statistically  significant  difference 
(P  =  <0.001)  in  the  mean  values  among  the  different  levels  of  Age  as  well  as  in  the 
mean  values  among  the  different  levels  of  Genotype  (P  =  0.008). 


but  the  magnitude  of  the  difference  is  greater  according  to  the  instrument  software 
analysis.  The  instrument  software  finds  increasing  hardness  as  the  dosage  of  oim 
alleles  increases  in  1 1  week  old  animals,  while  the  manual  analysis  finds  that  at  this  age 
there  is  no  difference  between  the  +/+  and  oim/+  heterozygotes,  with  both  having  lower 
H  than  the  oim/oim  homozygotes.  .  There  was  no  genotype-dependent  difference  in 
longitudinal  hardness  at  17  weeks  in  B6C3  animals,  but  both  tested  genotypes  were 
harder  than  C57BL/6  by  software  analysis  (insert  stats),  while  only  oim/+  was  harder 
than  C57BL/6  by  manual  analysis  (insert  stats).  For  those  genotypes  studied  at  both  1 1 
and  17  ages,  hardness  increased  with  age  (insert  stats),  but  once  again  the  magnitude 
of  the  age  effect  was  greater  by  the  software  analysis  than  the  manual  analysis. 

For  Young’s  modulus,  manual  analysis  gave  consistently  higher  values  than 
those  calculated  by  the  instrument  software.  The  instrument  software  finds  that  for  the 
B6C3  mice,  regardless  of  Col1a2  genotype  have  a  greater  Young’s  modulus  in  the 
longitudinal  orientation  than  in  the  cross  orientation  (insert  stats),  while  there  is  no 
orientation-dependent  difference  in  C57BL/6.  In  contrast,  the  manual  analysis  finds 
greater  modulus  in  the  longitudinal  orientation  for  both  the  C57BL/6  and  B6C3  strains 
(insert  stats).  The  analyses  differ  most  dramatically  in  assessing  the  impact  of  Col1a2 
genotype  at  11  weeks  of  age.  Instrument  software  finds  that  oim/+  and  oim/oim  have 
lower  Young’s  modulus  than  +/+  (insert  statistics).  Manual  analysis  leads  to  oim/oim 
mice  having  a  significantly  greater  Young’s  modulus  than  either  +/+  or  oim/+  animals 
(show  stats).  Both  analyses  show  an  increase  in  Young’s  modulus  between  11  and  17 
weeks  (show  stats). 

COMPLIANCE  IS  RELATED  TO  CORTICAL  THICKNESS  IN  LONGITUDINAL 
ORIENTATION 

Figure  3  shows  the  J0-based  compliance  analysis  of  bones  tested  in  the 
longitudinal  orientation.  It  reveals  that  in  general,  there  is  a  genotype-dependent 
difference  in  system  compliance,  as  indicated  by  the  slope  of  the  various  regression 
lines.  Bone  from  +/+  bones  show  essentially  0  slope,  while  oim/oim  bones  show  a 
slope  of  approximately  ***(plug  in  number).  Heterozygous  bones  span  the  entire  range 


from  ***  to  Since  machine  compliance  is  uniform  across  specimens,  the  differences  in 
system  compliance  represent  differences  in  sample  compliance.  At  the  right  of  figure  3, 
we  illustrate  schematically  our  interpretation  of  the  origin  of  the  observed  differences  in 
specimen  compliance.  While  +/+  bones  have  thick  cortices  in  relation  to  overall  bone 
diameter,  oim/oim  bones  have  a  much  smaller  cortical  thickness  and  oim/+  bones  have 
cortical  thicknesses  that  span  a  wide  range  of  values.  Unfortunately,  we  did  not 
measure  our  specimens  prior  to  nanoindentation  testing,  but  figure  4  shows  a  SEM 
image  of  fractured  +/+  and  oim/+  femora  illustrating  the  extent  of  variation  observed  in 
actual  specimens. 

Figure  5  shows  the  time-depth-based  creep  analysis  for  the  three  Col1a2 
genotypes  and  reveals  that  +/+,  oim/+  and  oim/oim  bones  behaved  identically. 


DISCUSSION 

The  experiments  reported  here  highlight  some  Coi1a2  genotype-dependent 
features  of  bone  tissue  and  illustrate  the  importance  of  nanoindentation  protocol  on 
interpretation  of  the  data.  In  particular,  we  demonstrate  the  importance  of  embedding 
and  specimen  orientation. 

AFM  scanning  of  polished  bone  surfaces  reveal  focal  areas  in  which 
mineralization  appears  to  be  reduced.  This  is  not  a  uniform  feature  of  the  bone  matrix, 
as  many  areas  examined  in  the  oim/+  and  oim/oim  specimens  look  no  different  from  +/+ 
bone.  However,  we  did  not  observe  any  apparently  undermineralized  fields  in  any  of 
the  +/+  specimens  studied. 

We  note  substantial  differences  between  the  Young’s  modulus  values  obtained 
by  the  software  and  manual  analyses.  The  manual  analysis  gives  greater  values  of  E, 
and  the  magnitude  of  the  discrepancy  between  the  2  methods  is  greater  in  the 
longitudinal  orientation.  The  compliance  analysis  allows  us  to  explain  these  apparent 
discrepancies.  The  software  analysis  doesn’t  account  for  specimen  compliance,  while 
the  manual  analysis  explicitly  measures  it.  In  the  longitudinal  specimen  orientation, 
indentation  occurs  in  a  region  of  bone  that  is  unsupported  by  an  embedding  substrate  or 
by  a  continuous  column  of  tissue.  The  specimen  can  therefore  deform  in  response  to 


the  force  imparted  by  the  indenter.  The  deformation  is  reflected  in  the  specimen’s 
compliance,  which  generally  increases  in  relation  to  the  specimen  diameter  and 
decreases  in  relation  to  cortical  thickness.  By  accounting  for  sample  deformation,  the 
manual  analysis  yields  an  increased  E.  This  explanation  accounts  for  the  greater 
impact  of  the  analysis  method  in  the  longitudinal  orientation  than  in  the  cross 
orientation,  and  in  specimens  with  greater  compliance  than  in  specimens  with  lesser 
compliance.  That  specimen  compliance  is  considered  in  the  manual  analysis  leads  us 
to  consider  the  moduli  obtained  in  this  way  as  more  reliable  than  those  obtained  by  the 
instrument  software. 

Lesser  discrepancies  are  observed  between  the  hardness  values  obtained  by  the 
2  methods.  These  reflect  the  areas  used  in  calculating  hardness.  For  the  manual 
analysis,  area  was  determined  on  the  basis  of  imaging,  while  the  area  function  was 
used  to  determine  area  in  the  software  analysis.  Because  the  manual  analysis  is  based 
on  a  direct  measurement,  it  should  be  considered  more  reliable. 

Compliance  analysis  of  the  sort  performed  here  is  possible  regardless  of  the 
methods  used  to  prepare  the  specimens,  but  its  use  with  unembedded  bone  specimens 
allows  the  application  of  nanoindentation  methods  to  a  novel  problem.  While  past 
nanoindentation  studies  of  bone  have  only  sought  to  study  hardness  and  Young’s 
modulus,  we  show  here  that  a  bone’s  compliance  behavior  is  also  amenable  to  study. 
While  the  experiments  reported  here  are  insufficient  to  describe  the  precise 
relationships  between  bone  tissue  material  properties,  specimen  geometry,  and 
compliance,  fuller  elucidation  of  these  is  a  topic  worthy  of  further  study. 

With  regard  to  the  effects  of  genotype,  age,  and  orientation  on  hardness  and 
Young’s  modulus,  we  find  that  Young’s  modulus  and  hardness  increase  with  age  in  both 
+/+  and  oim/+  bones  tested  in  the  longitudinal  orientation.  We  find  that  these  2 
genotypes  do  not  differ  significantly  from  each  other  for  Young’s  modulus,  but  that 
hardness  is  marginally  greater  (double  check  for  significance)  in  the  oim/+  bones.  We 
find  that  Young’s  modulus  is  greater  in  oim/oim  bone  than  either  of  the  other  2  Col1a2 
genotypes  in  11  week  old  specimens  tested  in  the  longitudinal  orientation.  Lastly,  we 
find  that  both  Young’s  modulus  and  hardness  are  greater  in  the  longitudinal  than  the 
cross  orientation. 


These  findings  can  be  compared  to  past  studies  of  biomechanical  performance  in 
the  Col1a20im  system,  (insert  highlights  of  past  work  here  need  to  include  studies  by 
Camacho,  King,  Landis,  Fratzl.  Also  include  your  tooth  data). 

Our  findings  in  the  Col1a2oim  system  can  also  be  considered  in  relation  to  other 
investigations  of  tissue-level  biomechanical  properties  in  other  forms  of  01.  It  is 
unsurprising  that  we  observe  increases  in  both  hardness  and  Young’s  modulus  between 
11  and  17  weeks.  At  11  weeks,  mice  are  still  growing,  while  17  weeks  is  considered  to 
be  the  age  at  which  skeletal  maturity  is  reached  in  wild  type  mice  (Beamer  Bone  1995 
or  1996).  In  human  01,  frequent  fractures  in  childhood,  relative  sparing  during  young 
adulthood  and  middle  age,  and  increased  fracture  frequency  in  the  later  years  is  a 
frequently  observed  clinical  pattern  (Spotila  1994  and  perhaps  other  references  within 
that  one). 

There  are  several  limitations  for  the  work  presented  here.  One  is  that 
we  did  not  obtain  continuous  indentation  values  for  all  groups,  which 
precluded  manual  calculation  of  Young's  Modulus  and  hardness  for  some 
genotypes.  Another  limitation  is  that  we  did  not  have  samples  for  oim/oim 
at  17  weeks  of  age,  or  C57BL/6  at  11  weeks.  As  a  consequence,  we  were 
not  able  to  report  those  results.  A  third  limitation  of  our  study  is  that  our 
nanoindentation  apparatus  is  not  housed  in  a  controlled  humidity 
environment.  This  limited  our  ability  to  ensure  that  the  hydration  status  of 
all  specimens  was  strictly  equivalent.  A  fourth  limitation  is  that 
nanoindentation  is  insensitive  to  anisotropy.  Thus,  although  our  data 
suggest  the  presence  of  anisotropic  biomechanical  behavior,  mirroring  the 
known  anisotropy  of  bone's  molecular  components,  it  is  likely  that  the 
impact  of  tissue  orientation  on  mechanical  performance  has  been 
underestimated  here.  Finally,  it  is  important  to  recognize  that  analysis  of 
unembedded  specimens  has  magnified  the  differences  between  software  and 
manual  analysis  of  the  data. 

Regardless  of  the  limitations,  we  were  able  to  present  a  new  approach 
for  a  deeper  understanding  of  bone  mechanics  in  chain  deficiency 


osteogenesis  imperfecta.  Our  work  can  be  utilized  to  further  explore  whole 
bone  mechanical  failure  in  other  diseases  such  as  osteoporosis.  Application 
of  nanoindentation  may  provide  insight  regarding  the  mechanical 
consequences  or  other  mutations,  drug  treatments,  and  husbandry 
conditions. 
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ore  1.  AFM  topographical  images  of  bone  for  the  three  Colla2  genotypes.  +/+  is 
xated  in  panels  A  and  D,  aim/  +  in  panels  B  and  E  and  cdm/oim  in  panels  C  and  F. 
Scale  bar  far  A-C  corresponds  to  lpm.  Scale  bar  for  D-F  corresponds  to  400nm 
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Table  1.  Hardness  (GPa)  According  to  Age,  Orientation,  and  Analysis  Method 
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Table  2.  Young’s  Modulus  (GPa)  According  to  Age,  Orientation,  and  Analysis 

Method 


J0  Correlation  for  colla2oim  Genotypes 
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Correlation  for  col1a2oim  Genotypes.  +/+  is  represented  on  black,  oim/+  < 
oim/oim  is  represented  on  black.  Circles  in  the  right  illustrate  scheme  of 
compliance  effects  in  +/+  and  oim/oim 
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Figure  4.  SEM  Images  of  Femoral  Fracture  Surfaces  in  +/+  (A)  and  oiml+  (B) 
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Figure  5.  Representation  of  creep  from  time  vs  depth  data  for  the  three  Col1a2  genotypes 


